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I i 

The present invention relates to a gene useful in a process to increase the microbial pro- 

i • 

duction of»carotenoids. 

i 

The carotenoid astaxanthin is distributed in a wide variety of organisms such as animals* 

* f \ I 

algae and microorganisms- It has a strong antioxidation property against reactive oxygen 

species. Astaxanthin is used as a coloring reagent, especially in the industry of farmed fish, 
such as salinoh, because astaxanthin imparls distinctive orange-^ed coloration to the ani- 
mals and contributes to consumer appeal iiv the marketplace. \ 

One of the steps in the carotenogenic pathway of,* e.g. Phaffia rhJdozyma y from a general 
metabolite, acetyl-CoA is the isomerization of isopentenyl pyrophosphate (IPP) to di- 



,1 



methylaryj pyrophosphate (DMAPP) by the action of IPP isomerase. Then, IPP and 

DMAPP are converted to aQo unit, gWanyl pyrophosphate (GP 

: . \ 
densation. • { \ 



In a similar condensation reaction be 



jweeA 



GPP and IPP, GPP h 



>) by the head to tail con- 



converted to Cis unit, 



15 farnesyi pyrophosphate (FPP) which is an important substrate o'f cholesterol in animal and 
ergosterol'in yeast, and of farnesylatic n of jregulation protein such as RAS protein. In 
general, tlxe biosynthesis of GPP and 3 PP from IPP and DMAPP are catalyzed by one en- 
zyme calldd FPP synthase. On the other hand, in prokaryotes sufch as eubacteria, iso- 
pentenyl pyrophosphate was synthesized in a different pathway via l-deoxyxylulose-5- 
20 phosphate from pyruvate which is abient in yeast and animaL Most of the genes involved 
in the meyalonate pathway and FPP synthase gene were cloned from P. rhodozyma. 
(EP 955,363): l ^ i 



HEI/sk 27-09.2002 
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In one aspect, the present invention provides a novel DNA fragment comprising a gene 
encoding the enzyme squalene synthase, i 



i ! 3 

More particularly, the present invention provides a DNA containing regulatory regions, 
such as promoter and terminator, as v^ell as; the open reading frame of squalene synthase 



gene. 



The present invention provides a DNX fragment encoding squalene synthase in Phaffia 
rhodozyma* The said DNA means a cE)NA yvhich contains only open reading frame 
flanked between the short fragments in its 5*- and 3'- untranslated region, and a genomic 
DNA whic|i also contains its regulatory sequences such as its promoter and terminator 
which are necessary for the expressionjof die squalene synthase gene in P. rhodozyma. 

1 



Accordingly, the present invention relates tb a polynucleotide comprising a nucleic acid 
molecule selected from the group consisting ofi 



(a) nucleic acid molecules encoding atjleast ? the mature form of the polypeptide depicted in 
SEQIDNO;3; f j j 

(b) nucleic acid molecules comprisingjthe coding sequence as depicted in SEQ ID NO:2; 

(c) nucleicacid molecules whose nucleotide sequence is degenerate as a result of the 
genetic code to a nucleotide sequence of (a; or (b); 

(d) nucleic acid molecules encoding a polypeptide derived from the polypeptide encoded 
by a polynucleotide of (a) to (c) by way of substitution, deletion and/or addition of one or 
several amino acids of the amino acid -sequence of the polypeptide encoded by a poly- 
nucleotide of (a) to (c); || 



i 



(e) nucleic acicl molecules encoding a polypeptide derived from lie polypeptide whose 
sequence has an identity of 51.3 % or porej to the amino acid sec uence of the polypeptide 
encoded by a nucleic acid molecule of; (a) or (b)j 

(f) nucleic; aci^ molecules comprising ja fragment or a epitope-bearing portion of a poly- 
peptide ericoded by a nucleic acid mo|ecul| of any one of (a) to (je) and having squalene 



sequence of a nucleic acid 



synthase activity; 

(g) nucleic acid molecules comprising a polynucleotide having a 
molecule amplified from Phaffia or Xanthdphylomyces nucleic acid library using the 
primers depicted in SEQ ID NO:4> 5, and 6 

(h) nucleic acid molecules encoding ^polypeptide having squalene synthase activity, 
wherein said polypeptide is a fragment of £ polypeptide encodedjby any one of (a) to (g); 

(i) nucleic acid molecules comprising- at least 15 nucleotides of ajpolynucleotide of any one 
of (a) to (<!);. | I ! 
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(j) nucleic acid, molecules encoding a polypeptide having squalene synthase activity, 
wherein said polypeptide is recognized- by ahtibodies that have been raised against a 
polypeptide encoded by a nucleic acid molecule of any one of (a) ?to (h); 
(k) nucleic acid molecules obtainable by screening an appropriatl library under stringent 

\ i ^ 

conditions with a probe having the sequence of the nucleic acid molecule of any one of (a) 

to (j), and encoding a polypeptide having ajsqualene synthase activity; and 

* \ f 
(1) nucleic acid molecules whose complementary strand hybridizes under stringent con- 

ditions with a nudeic acid molecule of any pne of (a) to (k), and encoding a polypeptide 

having squalene synthase activity. \ 

\, j . . I 

The terms "geneCs)", "polynucleotide",] "nucleic acid sequence", "iiudeotide sequence' 1 , 

"DNA sequence" or "nucleic acid mol£cule([s)" as used herein refers to a polymeric form of 

nucleotides of any length, either ribonucleotides or deoxyribonu£leotides. This term refers 

only to the primary structure of the mblecule. [ 



5 



Thus, this term includes double- and single- stranded DNA, and [RNA. It also includes 

15 known types of modifications, for exainplet methylation, "caps" substitution of one or 
• • I * I ■ 

more of the naturally occurring nucleptides with an analog. Preferably, the DNA sequence 

of the invention comprises a coding sequence encoding the above-defined polypeptide. 
. * • I J 3 

• I * \ 
A "coding sequence" is a nucleotide se<quen?ce which is transcribed into mRNA and/or 

translated into a polypeptide when placed under the control of appropriate regulatory 
20 sequences. The boundaries of the coding sequence are determined by a translation start 
codon at the S'-terminus and a translationUop codon at the 3'-t irminus. A coding 
sequence can include, but is not limited to [mRNA, cDNA, recon binant nucleotide 
sequences or genomic DNA, while intxons|may be present as wefl under certain circum- 
stances. SEQ ID:1 depicts the genomic Dl^fA in which the intron sequence is inserted in 
25 the coding sequence for squalene synt'hasefgene from Phajfia thodozyma. 

In general, the gene consists of several parts which have different functions from each 

other. In eukaryotes, genes which encode corresponding protein, are transcribed to pre- 

! * I * I 

mature messenger RNA (pre-mRNA)| differing from the genes for ribosomal RNA (rRNA), 

small nuclear RNA (snRNA) and transfer RNA (tRNA). Although RNA polymerase II 
30 (PolII) plays a central role in this transcription event, PolII can not solely start transcrip- 
tion without cis element covering an upstream region containing a promoter and an up- 
stream activation sequence (UAS), ar d a tjrans-acting protein factor. At first* a transcrip- 
tion initiation complex which consists of several basic protein components recognize the 
promoter! sequence in the 5*-adjacent region of the gene to be expressed. In this event, 

:. 1 if 
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some additional participants are required in the case of the gene 4hich is expressed under 
. some specific regulation, such as a heat shock response, or adaptation to a nutrition 
starvation, and so on. In such a case, a ! UAS is required to exist in the S'-untranslated 
upstream region around the promoterisequjjnce, and some positive or negative regulator 
proteins recognize and bind to the UAS. The strength of the binding of transcription 
initiation complex to. the promoter sequence is affected by such ajbinding of the trans- 
acting factor around the promoter, and this] enables the regulation of transcription activity. 

1 . ' s 3 ? 

After the activation of a transcription initiation complex by the phosphorylation, a tran- 
scription initiation complex initiates transcription from the transcription start site. Some 
parts of the transcription initiation cojnplek are detached as an elongation complex from 
the promoter region to the 3' directiori of the gene (this step is called as a promoter 
clearance event) and the elongation cqimpl<pc continues the transcription until it reaches to 
a termination sequence that is located jin th<2 3*-adjacent downstrpm region of the gene. 
Pre-mRNA thus generated is modified in nucleus by the addition of cap structure at the 
cap site which!a!most corresponds to the transcription start site, and by the addition of 



poIyA stretches at the polyA signal whjch locates at the 3'-adjacent downstream region. 
Next, intron structures are removed from coding region and exoh parts are combined to 
yield an open reading frame whose sequence corresponds to the primary amino acid 
sequence of a corresponding protein. jThis|modification in which a mature mRNA is 
generated is necessary for a stable gene expression. cDNA in general terms corresponds to 
theDNA sequence which is reverse-transcribed from this mature mRNA sequence. It can 
be synthesized by the reverse transcriptase jderived from viral spelies by using a mature 

mRNA as a template, experimentally. 5 

j | 

To express a gene which was derived from k eukaryote, a procedure in which cDNA is 
cloned into an expression vector for E. colitis often used. This results from a fact that a 
sperificityof intron structure varies ajnong the organisms and an inabUity to recognize the 
intron sequence from other species. I^i faci, a prokaryote has nolintron structure in its 
own genetic background. Even in yeajst, tlje genetic backgroundjis different between 
Ascomycetes to which Saccharomyces cerevisiae belongs and Basidwmycetes to which P. 
rhodozyma belongs, e.g. the intron stmctuje of actin gene from A rhodozyma cannot be 
recognized or spliced by the ascomyc^tous] yeast, Saccharomyces cerevisiae. The intron 
structures of some kinds of the gehes Jappear to be involved in the regulation of the expres- 
sion of their respective gene. It might be important to use a genpmic fragment which has 
. its introns in a case of self-cloning of jthe gine of a interest whose intron structure involves 
35 such a regulation of its own gene expression^ 



To apply a genetic engineering method for a strain improvement study, it is necessary to * 

study its genetic mechanism in the event sufch as transcription and translation. It is 

important to determine a genetic sequence such as its UAS, promoter, intron structure and 

i i J. 
terminator' to study the genetic mechaftism? { 

i I I 

According to this invention, the gene encoding squaiene synthase (SQS) gene from P. 
rhodozyma* including its 5*- and S'-adjicent: regions as well as its intron structure were 

! i J j 1 

determined. j J 

; j , 

The invention' further encompasses polynucleotides that differ from one of the nucleotide 
sequences shown in SEQ ID NO:2 (ancl portions thereof) due to jlegeneracy of the genetic 
code and thus encode a squaiene synthase as that encoded by thejnucleotide sequences 
shown in SEQ ID NO;2. Further the golynudeotide of the invention has a nucleotide 

1 i ^ i 

sequence encoding a protein having an amino acid sequence shown in SEQ ID NO:3. In a 

i i ? i 

still further embodiment, the polynucleotide of the invention encodes a full length Phajfia 

; J i 
rhodozyma protein which is substantially homologous to an amino acid sequence of SEQ . 

IDNO;3. ! i j j 

In addition, it will be appreciated by those -skilled in the art that DNA sequence polymor- 
phism that lead to changes in the amino acid sequences may exist within a population 
(e.g., the R rtiodozyma population). Such genetic polymorphism in the squaiene synthase 
gene may exist among individuals within ajpopulation due to napral variation. 

As used herein, the terms "gene" and '{recombinant gene" refer to nucleic acid molecules 
comprising an open reading frame encoding a squaiene synthase, preferably a squaiene 
synthase from P. rhodozyma. I l 

: . • • | I. 

Such natural variations can typically result- in 1-5% variance in the nucleotide sequence of 

I I ^ i 

the squaiene synthase gene. Any and all such nucleotide variations and resulting amino 

acid polymorphism in squaiene synthase that are the result of natural variation and that do 

not alter the functional activity of sqtjalen^ synthase are intended to be within the scope of 

the invention; (5 | 

Polynucleotides corresponding to natural jj/ariants and non-P. rhodozyma homologues of 
the squaiene synthase cDNA of the invention can be isolated baied on their homology to 
P. rhodozyma* squaiene synthase polynucleotides disclosed herein using the polynucleotide 
of the invention, or a portion thereof; as a^hybridization probe according to standard 
hybridization techniques under stringent hybridization conditions. Accordingly, in 

another embodiment, a polynucleotide ofjthe invention is at least 15 nucleotides in length. 

s £ 

• ( i 

i 1 
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Preferably it hybridizes under stringent conditions to the nucleic jacid molecule comprising 
a nucleotide sequence of the polynucleotide of the present invention, e.g. SEQ ID. NO:2. 
In other embodiments, the nucleic acid is at least 20, 30, 50, 100, 250 or more nucleotides 
in length. The term "hybridizes under) stringent conditions" is defined above and is 
intended to describe conditions for hybridization and washing u&der which nucleotide 
sequences at least 60% identical to each other typically remain hybridized to each other. 

1 1 1 \ i 

Preferably,^ the conditions are such that sequences at least about 65% or 70%, more 
preferably jat least about 75% or 80%, |md even more preferably at least about 85%, 90% or 
95% or more identical to each other typically remain hybridized lo each other- Preferably, 
the polynucleotide of the invention that hybridizes under stringent conditions to a 
sequence of SEQ ID NO:2 corresponds to I naturally occurring nucleic acid molecule. 

In the present invention, the polynudfeotidi sequence includes s!q ID NO:2 and frag- 

\ t \ i 

ments thereofhaving polynucleotide sequences which hybridize to SEQ ID NO:2 under 
stringent conditions which are sufficient to[ identify specific binding to SEQ ID NO:2. For 
example, a!ny combination of the following hybridization and wash conditions may be 
used to achieve the required specific binding: j 
High Stringent Hybridization: 6X SScj; 0.5f/o SDS, 100 ng/ml denatured salmon sperm 
DNA, 50% formamide, incubate overnight with gentle rocking ai 42°C 
High Stringent Wash: 1 wash in 2X SSC, 05% SDS at room temperature for 15 minutes, 
followed by another wash in 0.1X SSG, 0.5% SDS at Room Temperature for 15 minutes. 
Low Stringent Hybridization: 6X SSC 



0.5% SDS, 100 Jig /ml denatured salmon sperm 
DNA, 50% formamide, incubate over light with gentle rocking at 37°C. 

Low Stringent Wash: 1 wash in 0.1X SSC, 0-5% SDS at room terrlperature for 15 minutes. 

i J ( 

Moderately stringent conditions may jbe obtained by varying the] temperature at which the 
hybridizationlreaction occurs and/or r the wash conditions as set forth above. In the 
present invention, it is preferred to use high stringent hybridization and wash conditions 
to define the antisense activity againsl squilene synthase gene from P. rhodozyma. 

•;..! ( ! | 1 

The term ''homology' 1 means that the (respective nucleic acid molecules or encoded pro- 
teins are functionally and/or structurally equivalent. The nucleifc acid molecules that are 
homologous to the nucleic acid molepilesjjdescribed above and that are derivatives of said 
nucleic acid molecules are, for example, variations of said nucleic acid molecules which 
represent modifications having the same biological function, in particular encoding pro- 
teins with! the same or substantially the same biological function*. They may be naturally 
occurring variations, such as sequences from other plant varieties or species, or mutations. 



t 
\ 
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1 I . ! 
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These mutations may occur naturally or maybe obtained by mutagenesis techniques. The 
allelic variations may be naturally occurring allelic variants as well as synthetically pro- 
duced or genetically engineered variants. Structurally equivalents can, for example, 
identified by testing the binding of said polypeptide to antibodies! Structurally equivalent 
have the similar immunological characteristic, e.g. comprise similar epitopes. 

As used herein, a "naturally-occurringj nuc|eic acid molecule refers to an RNA or DNA 
molecule having a nucleotide sequent tha^ occurs in nature (e.gj encodes a natural pro- 
tein). Preferably, the polynucleotide encodes a natural P. rhodozy t ma squalene synthase. 



In addition to naturally-occurring variants jof the squalene synthase sequence that may 
exist in the: population, the skilled artisan will further appreciate fhat changes can be 
introduced by mutation into a nucleotide sequence of the polynucleotide encoding 
squalene synthase, thereby leading to changes in the amino acid Sequence of the encoded 
squalene synthase, without altering the functional ability of the squalene synthase. For 
example, nucleotide substitutions leading to amino acid substitutions at "non-essential" 
amino acid residues can be made in a |equence of the polynucleotide encoding squalene 
synthase, e' g. SEQ ID NO:2. A "non- essential" amino acid residue is a residue that can be 
altered from the wild- type sequence o( onejof the squalene synthase without altering the 
activity of said squalene synthase, whejreas an "essential" amino acid residue is required for 



squalene synthase activity. Other amino acfid residues, however, 



[e.g., those that are not 



conserved or only semi-conserved in the domain having squalene synthase activity) may 



not be essential for activity and thus are likely to bis amenable to 
squalene synthase activity. 



dteration without altering 



Accordingly, the invention relates to polynucleotides encoding squalene synthase that con- 
tain changes in amino acid residues thtat arjp not essential for squalene synthase activity. 
Such squalene synthase differs in. amino acid sequence from a sequence contained in SEQ 
ID NO;3 yet retain the squalene synthase activity described herein. The polynucleotide can 
comprise a nucleotide sequence encoding & polypeptide, wherein the polypeptide com- 
prises an amino acid sequence at least] about 60% identical to an [amino acid sequence of 
SEQ ID NO: 3 and is capable of participation in the synthesis of squalene. Preferably, the 
protein encoded by the nudeic acid nrtolecule is at least about 60r65% identical to the 
sequence in SEQ ID NO:3, more preferably at least about 60-70°|> identical to one of the 
sequences'in SEQ ID NO:3, even more preferably at least about 70-80%, 80- 90%, 90-95% 
homologous to the sequence in SEQ ID NO:3, and most preferably at least abovit 96%, 
97%, 98%, or 99% identical to the sequence in SEQ ID NO:3. j 

? 5 



• To determine the percent homology of tworamino acid sequence*] (e.g., one of the sequen- 
ces ofSEQ ID NO:3 and a mutant form thereof) or of two nucleiq' acids, the sequences are 

I Id 
aligned foroptimal comparison purposes (e.g., gaps can be introduced in the sequence of 

one protein or nucleic acid for optimal alignment with the other protein or nucleic acid). 
5 The aminojacid residues or nucleotides at corresponding amino a*cid positions or nucleo- 
tide positions are then compared. When a position in one sequence (e.g., one of the 
sequences of SEQ ID NO:2 or 3) is occupied by the same amino atid residue or nucleotide 
as the corresponding position in the other sequence (e.g., a mutant form of the sequence 

selected), then the molecules are homqlogoiis at that position (i.ei, as used herein amino 
! . 1 f i j 

10 acid or nucleic acid "homology" is equivalent to amino acid or nucleic acid "identity"). 

The percent homology between the two sequences is a function of the number of identical 

positions shared by the sequences (LeJ % homology = numbers Jf identical positions/total 

numbers of positions x 100), The homology can be determined l>y computer programs as 

Blast 2.0 (Aitschul SF, Nuc. Acid. Res.,>25, 3389-3402, 1997). In this invention* 

15 GENETYX-SV/RC software (Software|Dev|lopment Co., Ltd., Tcjkyo, Japan) is used by 

using its default algorithm as such homology analysis software. This software uses the 

Lipman-Pearson method for its analytic algorithm. 

i i 

A nucleic acidrnolecule encoding a sqpalei|e synthase homologous to a protein sequence 
of SEQ ID jNCte can be created by introducing one or more nucleotide substitutions, 

20 additions or deletions into a nucleotide sequence of the polynucleotide of the present 
invention, in particular of SEQ ID Nd:2 suth that one or more amino acid substitutions, 
additions or deletions are introduced into the encoded protein. Mutations can be intro- 
duced into the sequences of, e.g., SEQJD NO;2 by standard techniques, such as site- 
directed mutagenesis and PGR- mediated mutagenesis. Preferably, conservative amino 

25 acid substitutions are made at one or more, predicted non-essential amino acid residues. A 

"conservative amino acid substitution!' is one in which the amino acid residue is replaced 

with an amino acid residue having a sjmilat side chain. Families (of amino acid residues 

\ a i 
having similar side chains have been defined in the art These families include amino acids 

f i j E 

with basic.side chains (e.g., lysine, arginine*, histtdine), acidic side chains (e.g., aspartic 

30 acid, glutamic acid), uncharged polar jside j:hains (e.g., glycine, aiparagine, glutamine, 
serine, thrbonine, tyrosine, cysteine), honpolar side chains (e.g., alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side. chains 
(e.g., threonine., valine, isoleucine) ank aromatic side chains (e.gi, tyrosine, phenylalanine, 
tryptophan, histidine). Thus, a predicted nonessential amino acid residue in a squalene 

35 synthase is preferably replaced with ariother amino acid residue from the same family. 

Alternatively/in another embodiment, mutations can be introduced randomly along all or 
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part of a squalene synthase coding sequence, such as by saturation mutagenesis, and the 
resultant mutants can be screened for a squalene synthase activity described herein to 
identify mutants that retain squalene synthase activity. Following mutagenesis of one of 
the sequences of SEQ ID NO:2, the encode^ protein can be expressed recombinantly and 
the activity of the protein can be deterinined using, for example, Issays described herein. 

" * I I I 

Accordingly, ih one preferred embodiment the polynucleotide o^the present invention is 
DNAorRNA. \ 



i • i i ! 



A polynucleotide of the present invention, 'e.g., a nucleic acid molecule having a nucleotide 

sequence of SEQ ID NO:2, or a portioii thereof, can be isolated Jsing standard molecular 

10 biology techniques and the sequence information provided hereik. For example, squalene 

synthase cDNA can be isolated from a^ library using all or portion of one of the sequences 

' • I 
of the polynucleotide of the present invention as a hybridization jprobe and standard 

hybridization techniques. Moreover, k polynucleotide encompassing all or a portion of 

one of the.sequences of the polynucleotidejof the present inventibn can be isolated by the 

15 polymerase ctiain reaction using oligonucleotide primers designed based upon this 

sequence (e.g., a nudeic acid molecule encprnpassing all or a portion of one of the 

sequences jof polynucleotide of the presentiinvention can be isolated by the polymerase 

chain reaction using oligonucleotide primers, e.g. of SEQ ID NCj:4, 5 or 6, designed based 

upon this same sequence of polynucleotide of the present invention. For example, mRNA 

20 can be isolated from cells, e.g. Phaffia l (e.g.\ by the guanidinium-thiocyanate extraction 
; | i I 

procedure! of Chirgwin et al., and cDNA can be prepared using rfeverse transcriptase (e.g., 

Moloney MLV reverse transcriptase or AlviV reverse transcriptase available from Promega 

(Madisoni USA)). Synthetic oligonucleotide primers for polymerase chain reaction 

i .If | 

amplification 1 can be designed based uponjone of the nucleotidefsequences shown in SEQ 

25 ID NO:2. • A polynucleotide of the invention can be amplified using cDNA or, 

alternatively, genomic DNA, as a template* and appropriate oligonucleotide primers 

according to standard PCR amplification techniques. The polynucleotide so amplified can 

be cloned into an appropriate vector knd characterized by DNA sequence analysis 

Furthermore, oligonucleotides corresponding to a squalene synthase nucleotide sequence 

30 can be prepared by standard synthetic techniques, e.g., using an 

1 • . \ i 

synthesizer. • \ I 

: * i 

* I 

The terms "fragment", "fragment of a sequence" or "part of a sequence" means a truncated 
sequence jof the original sequence referred* to. The truncated sequence (nucleic acid or 
protein sequence) can vary widely in;length; the minimum size being a sequence of suf- 
35 ficient size to provide a sequence with at least a comparable function and/or activity of the 

; • ! ( 

IS 



automated DNA 



10 



15 
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original sequence referred to, while the maximum size is not critical. In some applications, 



the maximum size usually is not substantially greater than that rei 
desired activity and/or function(s) of the original sequence. 

i • . ! ! 



4 



uired to provide the 



Typically, the truncated amino acid sequence will range from abo\it 5 to about 60 amino 
acids in length! More typically, however, tlje sequence will be a maximum of about 50 
amino acids in length, preferably a maximum of about 30 amino acids. It is usually desir- 
able to select sequences of at least about 10,512 or 15 amino acidsjup to maximum of 
about 20 or 25 amino acids. i I 



J 
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The term "epitope" relates to specific immtfnoreactive sites within an antigen, also known 
as antigenic determinates. These epitopes can be a linear array of monomers in a poly- 
meric composition - such as amino acids in a protein - or consist! of or comprise a more 
complex secondary or tertiary structure. Those of skill will recoghize that all immunogens 
(L e-> substances capable of eliciting art immune response) are antigens; however, some 
antigen, such as haptens, are not immUnogens but may be made immunogenic by 
coupling to a carrier molecule. The term "antigen" includes references to a substance to 

; { i J 

which an antibody can be generated and/or to which the antibody is specifically 
immunoreactive. j I 

f I 

; 5 

The term "one or several amino acids 1 j relates to at least one amir to acid but not more than 
that numbjer of amino acids which would result in a homology ofrbelow 60% identity. 
20 Preferably, the identity is more than 70% o'r 80%, more preferred are 85%, 90% or 95%, 
even more preferred are 96%, 97%, 98%, or 99% identity. 



25 
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The term "squalene synthase" or "squalenejsynthase activity" relates to enzymatic activities 
of a polypeptide as described below or which can be determined fin enzyme assay method. 
Furthermore, polypeptides that are inactive in an assay herein but are recognized by an 
antibody specifically binding to squalene synthase, Le., having one or more squalene 
synthase epitopes, are also comprised|und|r the term f, squalene i ynthase". In these cases 
activity refers to their immunological|actiyity. 



The terms "polynucleotide" and "nucleic add molecule" also relate to "isolated" poly- 
nucleotides or nucleic acids molecules. An; "isolated" nucleic acid molecule is one which is 

* i 

separated from other nucleic acid molecules which are present in the natural source of the 
nudeic acid- Preferably, an "isolated*! nuc ! e * c ac ^ * s ^ ee °^ sequences which naturally 
flank the nucleic acid (i.e., sequences located at the 5* and 3 f ends of the nucleic acid) in the 
genomic DNA of the organism from jwhich the nucleic add is derived. 



! 
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i I i 
For example, in various embodiments,] the PNO polynucleotide can contain less than 

about 5 kbj 4Kb, 3kb, 2kb, 1 kb, 0.5 lcb bx 0.1 kb of nucleotide sequences which naturally 

flank the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is 

derived (e,g., zPhaffia cell). Moreover, thejjpolynucleotides of th$ present invention, in 

{ i jj 
5 particular an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substan- 
i * - I 

tialiy free of other cellular material, or culture medium when produced by recombinant 

techniques^ or chemical precursors or Otheii chemicals when chemically synthesized. 

; ;• | | | 

Preferably,:the;polypeptide of the invention comprises one of thelnudeotide sequences 

: { i { 

shown in SEQ ID NO:2. The sequence of SEQ ID NO:2 corresponds to the P. rhodozyma 

10 squalene synthase cDNAs of the invention. \ 



Further, the polynucleotide of the invention comprises a nucleic ficid molecule which is a 
complement of one of the nucleotide sequences of above mentioned polynucleotides or a 
portion thereof A nucleic acid molecule which is complementary to one of the nucleotide 
. sequences shown in SEQ ID NO:2 is ope wfiich is sufficiently complementary to one of the 
15 nucleotide!sequences shown in SEQ ID NC^:2 such that it can hyljridfee to one of the 
nucleotide sequences shown in SEQ ID Nd:2, thereby forming a stable duplex. 

The polynucleotide of the invention comprises a nucleotide sequence which is at least 
about 60%, preferably at least about 65-70%, more preferably at least about 70-80%, 80- 
90%, or 90-95%, and even more preferably; at least about 95%, 96%, 97%, 98%, 99% or 
20 more homblogous to a nucleotide sequence shown in SEQ ID N(p:2, or a portion thereof. 
The polynucleotide of the invention comprises a nucleotide sequence which hybridizes, 
e.g., hybridizes under stringent conditions ks defined herein, to dne of the nucleotide 
sequences shown in SEQ ID NO:2, or;a portion thereof. ( 



Moreover,' the polynucleotide of the invention can comprise only a portion of the coding 

25 region of one of the sequences in SEQ; ID NO:2, for example a fragment which can be used 

as a probe'or primer or a fragment encoding a biologically active] portion of a squalene 

synthase. The nucleotide sequences determined from the cloning of the squalene synthase 

gene from P. rhodossyma allows for the generation of probes and primers designed for use 

\ I * 
in identifying and/or cloning squalene synthase homologues in other cell types and 

30 organisms'. The probe/primer typically comprises substantially purified oligonucleotide. 

The oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 

under stringent conditions to at least 'about 12, 15 preferably about 20 or 25, morepre- 

ferably abbut'40, 50 or 75 consecutive nucleotides of a sense strand of one of the sequences 

i ' i I "J 

set forth, e.g.,. in SEQ ID NO:2, an anti-sense Sequence of one ofjthe sequences, e.g., set 

M ! 

i \ i . 
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forth in SEQ ID NO:2, or naturally occurring mutants thereof* Piimers based on a nucleo 
tide of invention can be used in PGR rfeactipns to clone squalene synthase homologues. 
Probes based on the squalene synthase nucleotide sequences can be used to detect tran- 
scripts or genomic sequences encoding the fcame or homologous proteins. The probe can 
further comprise a label group attached thereto, e.g. the label grojip can be a radioisotope, 

a fluorescent compound, an enzyme, or anfenzyme co-factor. Such probes can be used as 

• | 5 ' £ 

a part of a genomic marker test kit forjdentifying cells which express a squalene synthase, 

such as by measuring a level of a squalene synthase-encoding nucleic acid molecule in a 

sample of cells, e.g., detecting squalen* synjhase mRNA levels or (determining whether a 

genomic squalene synthase gene has been mutated or deleted. 

:. i . ! 

The polynucleotide of the invention encodes a polypeptide or portion thereof which in- 
cludes an amino acid sequence which is sufficiently homologous|to an amino acid 
sequence of SEQ ID NO:3 such that the protein or portion thereof maintains the ability to 
participate in the synthesis of squalene, in particular a squalene synthase activity as 
described in the examples in microorganisms or plants. As used 'herein, the language 
"sufficiendy homologous" refers to protein** or portions thereof ^hich have amino acid 
sequences which include a minimum number of identical or equivalent (e.g., an amino 
acid residue which has a similar side chainJas an amino acid residue in one.of the sequen- 
ces of the polypeptide of the present invention amino acid residues to an amino acid 
sequence of SEQ ID NO:3 such that tte protein or portion thereof is able to participate in 
the synthesis of squalene in microorganisms or plants. Example! of a squalene synthase 

j * { • 5 I 

activity are also described herein. J \ ? 

The protein is at least about 60-65%, £refc?rably at least about 66>-70%, and more prefer- 
ably at leapt about 70- 80%, 80-90%, ?0-9$%, and most preferably at least about 96%, 
97%, 98%!, 99% or more homologous^ to an entire amino acid sequence of SEQ ID NO:3. 

i I 

Portions of proteins encoded by the s ? quallne synthase polynucleotide of the invention are 
preferably biologically active portions of one of the squalene synthase. 

i ! S 

As mentioned herein, the term biologically active portion of squalene synthase" is in- 

'■ 19 \ 

tended tojincliide a portion, e.g., a dcimaiii/motif, that participates in the biosynthesis of 
squalene or has an immunological activity* such that it is binds to an antibody binding 
specifically to squalene synthase. To deterfnine whether a squalene synthase or a biologi- 
cally active portion thereof can participate in the metabolism an assay of enzymatic activity 
may be performed. Such assay methods are well known to thos| skilled in the art, as 
detailed in . the Examples. Additional; nucleic acid fragments encoding biologically active 

: M ! 



i ! 
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portions of a squalene synthase can be [prepared by isolating a portion of one of the 
sequences in SEQ ID NO:2, expressing the encoded portion of the squalene synthase or 
peptide (e.g., by recombinant expression in^vitro) and assessing the activity of the. encoded 
portion of the squalene synthase or peptide} 

• i I I 

5 At first, a partial gene fragment containing !a portion of SQS gene by using degenerate PGR 

method was cloned. The said degenerate PCR is a method to clone a gene of interest 

which has high homology of amino acid sequence to the known enzyme from other species 

which has a saine or similar function, (Degenerate primer, whichjis used as a primer in 

degenerate PCR, was designed by a reyerse translation of the amino acid sequence to 

10 corresponding nucleotides ("degenerated")- In such a degenerate primer, a mixed primer 

which consists any of A, C, G or T, or a priper containing inosuie at an ambiguity code is 

generally used. In this invention, such the mixed primers were used for degenerate 

primers ta clone above gene. j jj 

. ;' . . Li i 

An entire gene containing its coding region with its intron as well as its regulation region 
j i * h | 

15 such as a promoter or a terminator can be Honed from a chromosome by screening of 

1 I \ 
genomic library which is constructed in phage vector or plasmidjvector in appropriate 

host, by using a partial DNA fragment obtained by degenerate PCR as described above as a 

i 1 * I 

probe after it was labeled. Generally, R coli as a host strain and R coli vector, a phage 

if 5 
vector such as X phage vector, or a plasmid vector such as pUC vector is often used in the 

* & * 
20 construction of library and a following genetic manipulation such as a sequencing, a re- 

* r 1 f * 

striction digestion, a ligation and the like. In this invention, an EcoRl genomic library of P. 
rhodozyma was constructed in the derivatives of X vector, XDASHII. An insert size, what 

I i j 

length of insert must be cloned, was determined by the Southern blot hybridization for the 

t j * | . 

gene before a construction of a library. In(this invention, a DNA used for a probe was 

25 labeled with digoxigenin (DIG), a steroid hapten instead of conventional 32 P label, follow- 

ing the protocol which was prepared by the supplier (Boehringelr-Mannheim, Mannheim, 

, t | * { 

Germany). A genomic library constructed from the chromosome of P. rhodozyma was 

t if J 

screened by using a DIG-labeled DN£ fragment which had a portion of a gene of interest 
as a probe. Hybridized plaques were pick|d up and used for further study. In the case of 
30 using XDASHII (insert size was from|9 kbjto 23 kb), prepared XpNA was digested by the 

EcoRl, followed by the cloning of theiBcoI^I insert into a plasmid vector such as pUC19 or 

[ t i 
pBluescriptll SK-K A plasmid DNA thus obtained was examined for its sequence. 
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In this invention, we used the automated fluorescent DNA sequencer, ALFred system 
(Pharmacia, Uppsala, Sweden) using an autocycle sequencing protocol in which the Taq 
DNA polymerase is employed in most casei of sequencing. 

i ' t I 

After the determination of the genomic sequence, a sequence of ajc 



A . ^ - - - - -jcoding region was used 
for a cloning of cDNA of corresponding gerte. The PCR method was also exploited to 
clone cDNA fragment- The PCR primfers wjhose sequences were identical to the sequence 
at the 5'- and 3*- end of the open reading frlame (ORF) were synthesized with an addition 
of an appropriate, restriction site, and PCR was performed by usiijg those PCR primers. In 
this invention, a cDNA pool was used as a template in this PCR cloning of cDNA. The 
10 said cDNApool consists of various cD.NA species which were synthesized in vitro by the 
viral reverse transcriptase and Taq polymerase (CapFinder Kit manufactured by Clontech, 
Palo Alto, U.S-A.) by using the mRNAiobtained from P. rhodozyma as a template. cDNA 
of interest thus obtained was confirmed in its sequence. > 



i I 
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In another'embodiment, the present invention relates to a method for making a recombi- 

15 nant vector comprising inserting a polynucleotide of the invention into a vector. 

Further, the present invention relates io a recombinant vector containing the polynucleo- 
tide of the invention or produced by said method of the invention. 

As used herein, the term "vector" referk to a nucleic acid molecule capable of transporting a 

polynucleotide to which it has been linked^ One type of vector isja "plasmid' 1 , which refers 

20 to a circular double stranded DNA lodp info which additional D^IA segments can be 

if | 
ligated. Another type of vector is a viral vector, wherein additional DNA or PNA segments 

can be ligated into the viral genome, pertain vectors are capable |of autonomous replica- 
tion in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial 
origin of replication and episomal mammalian vectors). Other \rectors (e.g., non-episomal 

25 mammalian vectors) are integrated into the genome of a host cell upon introduction into 
the host cell, and thereby are replicated alopg with the host genome. Moreover, certain 
vectors are capable of directing the expression of genes to which ihey are operatively 
linked. Such vectors are referred to herein as "expression vectorsf . In general, expression, 
vectors of utility in recombinant DN/l techniques are often in th'e form of plasmids. In the 

30 present specification, "plasmid" and "vector" can be used interchangeably as the plasmid is 
the most commonly used form of vector, however, the invention is intended to include 
such other forms of expression vectors, such as viral vectors (e.gl replication defective 
retroviruses, adenoviruses and adenoj-assofciated viruses), which|serve equivalent 



functions. \ 

: , i j 
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The present invention also relates to cdsmids* viruses, bacteriophages and other vectors 

• * I * \ 

used conventionally in genetic engineering that contain a nucleic acid molecule according 

» i j i 

to the invention. Methods which are well khown to those skilled in the art can be used to 

construct various plasmids and vector^- Alternatively, the nucleic acid molecules and 
5 vectors of the invention can be reconstitute!! into liposomes for delivery to target cells. 

I I I 

In an other, preferred embodiment to present invention relates to^a vector in which the 
polynucleotide of the present invention is dperatively linked to expression control sequen- 
ces allowing expression in prokaryoticjor eiikaryotic host cells. The nature of such control 
sequences differs depending upon the Jiost prganism. In prokary6tes, control sequences 

10 generally include promoter, ribosomal binding site, and terminators. In eukaryotes, 
; } j « 

generally control sequences include prpmoters, terminators and, jjun some instances, en- 
hancers, transactivators; or transcription factors. I 

5 r I 
\ ; | f I 

The term "control sequence" is intended to|include, at a minimum, components the pre- 
sence of which are necessary for expression, and may also include additional advantageous 



15 components. » ! ( 

The term "operably linked" refers to a Juxtaposition wherein the components so described 
are in a relationship permitting them to function in their intended manner. A control 
sequence "operably linked" to a coding sequence is ligated in sucli a way that expression of 
the coding sequence is achieved underj conditions compatible with the control sequences. 
20 In case the control sequence is a pronfoter,jit is obvious for a skilled person that double- 
stranded nucleic acid is used, ! 8 j 

: i f ! ! 

' i * 
Such regulatory sequences are are known tp the skilled person- Regulatory sequences in- 

- elude those which direct constitutive expression of a nucleotide sequence in many types of 

e * I 

host cell and those which direct expression] of the nucleotide sequence only in certain host 
25 cells or under jcertain conditions. It w;ill be appreciated by those skilled in the art that the 
design of the expression vector can d^pen<j on such factors as the choice of the host cell to 
be transformed, the level of expression of protein desired, etc. Tne expression vectors of 
the invention |can be introduced into host tells to thereby produce proteins or peptides, 
including fusion proteins or peptides,' encoded by polynucleotides as described herein. 

s j i 

30 The recombinant expression vectors 6f the invention can be designed for expression of 
squalene synthase in prokaryotic or eiikarf otic cells. For example, genes encoding the 
polynucleotide of the invention can be expressed in bacterial cells such as JL coli, insect 
cells (using baculovirus expression vectors), yeast and other fun jal cells, algae, ciliates of 
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the types: Holofrichia, Peritrichia, SpiroWichia, Suctoria, Tetrahymqna, Paramecium, 
ColpidiuniyGlaucomay Platyophrya, Potpmacus, P$eudocohnilembus> Euplotes, Engelmaniella, 
and Stylonychia, especially of Stylortychta lemnae with vectors following, a transformation 
method as described in WO 98/01,572 jand multicellular plant cells or mammalian cells. 
5 Suitable host cells are known to the skilled person. Alternatively, the recombinant 
expression yector can be transcribed and translated in vitro, for eiample using T7 
promoter regulatory sequences and T7 5 polymerase, i 

1 ,v i i I 

Expressionlof proteins in prokaryotes is mqst often carried out with vectors containing 
constitutive or, inducible promoters directing the expression of either fusion or non-fusion 
proteins. Fusion vectors add a number of amino acids to a protejn encoded therein, 
usually to the amino terminus of the recombinant protein but also to the C-terminus or 
fused within suitable regions in the proteins. Such fusion vectors jjtypically serve three pur- 
poses: 1) td increase expression of recombinant protein; 2) to increase the solubility of the 



10 



15 



recombinant protein; and 3) to aid in the purification of the recombinant protein by 



acting as a ligand in affinity purification. Often, in fusion expression vectors, a proteolytic 
cleavage site is'introduced at the junction of the fusion moiety anjd the recombinant pro- 



tein to enable separation of the recombinant protein from the fusion moiety subsequent 1 
purification of the fusion protein. Such enzymes, and their cognate recognition sequences, 



i 

include Factor Xa, thrombin and enter okiriase. 

. , if 



i 
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20 . Typical fusion* expression vectors include p.GEX (Pharmacia Biotech Inc.), pMAL (New 



England Biolabs, Beverly, USA) and pRIT5j (Pharmacia, Piscatawky, USA) which fuse 
glutathione S-transferase (GST), maltpse E;binding protein, or p: otein A, respectively, to 
the target recombinant protein. In one embodiment, the coding sequence of the poly- 
peptide encoded by the polynucleotide of the present invention i i cloned into a pGEX 
expression vector to create a vector encoding a fusion protein comprising, from the N- 
terminus to the C-terminus, GST-thrombiji cleavage site-X protein. The fusion protein 
can be purified by affinity chromatography using glutathione-agarose resin, e.g. recombi- 
nant squalene synthase unfused to GST can be recovered by cleavage of the fusion protein 



25 



with thrombin. 
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Examples of suitable inducible non-fusion[E coli expression vectors include pTrc and pET 

lid. Target gene expression from thejpTrc vector relies on host RNA polymerase tran- 

If I 
scription from a hybrid trp-lac fusion; promoter. Target gene expression from the pET lid 

! * » $ i 

vector relies on transcription from a T7 gnJO-lac fusion promoter mediated by a co- 
expressed .viral RNA polymerase (T7 gnl). This viral polymerase is supplied by host strains 

. : I I ! 



} 20:6 AS'/( 7 J!"S8ue/d»3 



! 417- } 

! : i ! ! 

BL21(DE3). or HMS174(DE3) from a resident X prophage harboring a T7 gnl gene under 

i i \ 
the transcriptional control of the lacXJV 5 promoter. \ 

i 1 i t 

; ■ It 

One strategy to maximize recombinant protein expression is to express the protein in host 
bacteria with an impaired capacity to F>rote|>lytically cleave the recombinant protein. An- 

5 other strategy is to alter the nucleic acid sequence of the nucleic a£id to be inserted into an 

* ** \ 
expression vector so that the individual codbns for each amino acid are those preferentially 

utilized in the bacterium chosen for expression, such as £ coli. Silch alteration of nucleic 

.' \ * \ 

acid sequences of the invention can bejcarried out by standard DN A synthesis techniques. 

I M i 

Further, the squalene synthase vector can be a yeastexpression vector. Examples of vectors 

10 for expression in yeast S. cerevisiae include pYepSecl, pMFa* pJRY88, and pYES2 (Invitro- 

l '* \ 
gen, San Diego, USA). Vectors and methods for the construction pi vectors appropriate 

! j i } 

for use in other fungi, such as the filamentous fungi, are known to the skflled person. 

! :. ! ! j 

Alternatively, thepolynucleotide of the invention can be introduced in insect cells using 
bacuiovirus expression vectors. Baculdvirus vectors available for Ixpression of proteins in 
15 cultured insect cells (e.g., Sf 9 cells) inelude$the pAc series and the pVL series. 

: : ! I j 

Alternatively, the polynucleotide of thfe invention is introduced in mammalian cells using a 
mammalian expression vector. Examples cif mammalian expression vectors include 
pCDM8 and pMT2PC. When used in mammalian cells, the expression vectors control 
functions are often provided by viral regulatory elements. For example, commonly used 
20 promoters are derived from polyoma,lAdenoviru$ 2, cytomegalovirus and Simian Virus 
40. Other Suitable expression systems; for both prokaryotic and eukaryotic cells are known 
to the skilled person, I I } 

! I I 

The recombinant mammalian expression vector can be capable qf directing expression of 

i i j j 

the nucleic acid preferentially in a particular cell type (e,g., tissue-specific regulatory eie- 

25 ments are used to express the nucleic acid>. Tissue- specific regitiatory elements are 

i ^ • 
known in the art Non-limiting examples df suitable tissue-specific promoters include the 

albumin promoter (liver- specific), lymphoid- specific promoters, in particular promoters 

of T cell receptors and immunoglobulins, *neuron-specific promoters (e.g., the 

neurofilament promoter), pancreas-specific promoters, and mammary gland-specific 

30 promoters (e.g,, milk whey promoter; US 4,873, 316 and EP 264|l66). Developmentally- 

regulated promoters are also encompassed, for example the murine hox promoters and the 

fetoprotein promoter. ! | 
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enzyme assay method, 
following is the one of 



Thus expressed SQS gene can be verified for its activity such 
Some experimental protocols are descnbedlin the literature. The 
the methods which is used for the determination of squalene synthase activity: squalene 
synthase activities are determined by monitoring the conversion of [I- 3 H] FPP into 
5 squalene. Reaction mixtures (500 ml) includes 50 mM Tris-HCl (pH 7.4), 2 mM KF, 1 
mM MgCl 2 , 1 niM NADPH, enzyme, f 0 mM [l- 3 H]FPP (370 MBq/mmol, 3.7 kBq/mL; 
New England Nuclear, Boston, MA). Reactions are started by adding [l- 3 H] FPP. After a 
10-min incubation at 37 °C, reactions ;are terminated by adding 1< mL of ethanoL After 1 

! • 1 1 

mL of H2P is added, the mixtures are vigorously shaken with 3 mL of petroleum ether for 
10 30 min. Extracted lipids are evaporated and resuspended in 25 nil of chloroform. Samples 

are applied to plastic-backed sheets (Silica gel 60, F254; MeTck, Rkhway, NJ) for thin-layer 

* I j I 

chromatography (TLC), and developed in heptane for 15 min. Radioactivities included in 

• , j f i 

the squalene fraction are measured by : liquid scintillation counting. When expression 

vector for S. cerevisiae is used, a complementation analysis can be conveniently exploited 
15 by using conditional squalene synthase mutant ERG 9 strain deri/ed from S. cerevisiae as a 
host strain 2 for, its confirmation of activity (Werkulov et at, Yeast 16, 197-206, 2000). 

Succeeding to the confirmation of thejenzjjjme activity, an expressed protein would be 
purified and used for raising of the antibody against the purifiedfenzyme* Antibody thus 
prepared would be used for a characterization of the expression of the corresponding 
20 enzyme in* a sfrain improvement study, an [optimization study ofithe culture condition, 
and the like. \ 1 



30 



In a further embodiment, the present ^invention relates to an antibody that binds 
specifically to the polypeptide of the present invention or parts, £e. specific fragments or . 

« i f 

epitopes of such a protein. ! \ 



25 The antibodies of the invention can be used to identify and isolate other squalene synthase 
and genes! These antibodies can be monoclonal antibodies, polyclonal antibodies or syn- 

Fv or scFv fragments etc. 



thetic antibodies as well as fragments jof antibodies, such as Fab, 



Monoclonal Antibodies can be prepared, for example, by techniques known to the skilled 
person, which comprise the fusion ofjmouse myeloma cells to sp ? ieen cells derived from 

immunized mammals. \ j 

1 j. j 

Furthermore, antibodies or fragment^ thereof to the aforementioned peptides can be ob- 
tained by using methods which aretapiowp to the skilled person. These antibodies can be 
used, for example, for the immunoprecipitation and immunolo :alization of proteins 
according to the invention as well as for tlie monitoring of the synthesis of such proteins, 



W 

: I ! . 
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! I I 
for example, in recombinant organising ana for the identification! of compounds inter- 
acting with- the protein according to the invention- For example, surface plasmon 
resonance as employed in the BlAcore systepn can be used to increase the efficiency of 
phage antibodies selections, yielding a highiincrement of affinity from a single library of 
5 phage antibodies which bind to an epitope of the protein of the invention. In many cases, 
the binding phenomenon of antibodies to antigens is equivalent to other ligand/anti- 
ligahd binding. 5 i 

j j 

In this invention, the gene fragment for squalene synthase was cloned from P. rhodozyma 
with a purpose to decrease its expression level in P. rhodozyma by$ genetic method using the 
10 cloned gene fragment. j \ 



The present invention provides a process for the production of c4rotenoids wherein a gene 
encoding squalene synthase is modified in a suitable host, such a^ P. rhodozyma to decrease 
its expression, and cultivation of such a trajisfbrihant in an appropriate medium under 
appropriate cultivation conditions. | J J 

15 To decrease a gene expression with genetic [methods, some strategies can be employed. 

' ' i I ' 

One of which is a gene-disruption method.! In this method, a partial fragment of the ob- 

is l 
jective gene to be disrupted is ligated tp a drug resistant cassette on the integration vector 

which can hotreplicate in the host organism. A drug resistance gene which encodes the . 

enzyme that enables the host to survive in the presence of a toxicjantibiotic is often used 

20 for the selectable marker. G4rl8 resistance gene harbored in pGB;-Ph9 is an example of a 

i i \ J 

drug resistance gene which functions in P. rhodozyma. Nutrition^ complementation 
marker can be also used in the host which has an appropriate au#otrophy marker. P. 
rhodozyma ATCC24221 strain that require! cytidine for its growth is one example of the 
auxotrophl By using CTP synthetase £s donor DMA for ATCC2422I, a host vector system 
25 using a nutrition complementation can be [established. j 

i ! i 

After the transformation of the host organisms and recombination between the objective 

gene fragment on the vector and its corresponding gene fragment on the chromosome of 

the host organisms, the integration vector is integrated onto the fiost chromosome by 

it \ 
single cross recombination. As a result of this recombination, tKe drug resistant cassette 

» ; j * 

30 would be inserted in the objective gene whbse translated product is only synthesized in its 

1 i 4 * i 

truncated 'form which does not have its enzymatic function. In a similar manner, two 
parts of the objective gene were also used for gene disruption study in which the drug 
resistant gene can be inserted betweeA such two partial fragments of the objective genes on 
the integration vector. In the case of this tjype of vector, double recombination event 

II 1 
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between the gene fragments harbored on the integration vector and the corresponding 
gene fragments on the chromosome of.the ljtost are expected Although frequency of this 
double crossing-over recombination isjlower than single cross recombination, null 
phenotype of the objective gene by the jdouble cross recombination is more stable than by 

the single cross recombination. I I I 

i it 

) ? 5 

This strategy was used to construct thejlycopene-producing recombinant of Candida utilis 
which harbored bacterial carotenogenic genes on the plasmid (Shimada et al> (Applied 
and Environmental Microbiology, 64 (7), 2&76-2680, 1998)). £U&9 gene encoding 
squaiene synthase was cloned from G ytilisimd its gene disruptailt was induced after 
double cross recombination of ERG9 gene On the chromosome ofthelycopene-producing 
C. utilis. Shimada et al reported that the disruption of BRG9 gene gave a positive effect on 
carotenogenesis by the recombinant d utilis especially derived from the host in which 3- 
hydroxy methyiglutaryl-CoA reductase was amplified on the ribofeomal DNA locus multi- 
copied on the chromosome of the host. i \ 

'■ > If 

15 On the other hand, this strategy has difficulty in the case of the gene whose function is 
essential and disruption is lethal for tlve host organism such as sq lalene synthase gene. In 
the above reference (Shimada et al,), the disruption was made on the either copy of the 
squaiene synthase gene within the two)copi s es of those on the host chromosome. In such a 
construction, it was not confirmed that thej decreased level of thejsqualene synthase activity 

20 was sufficient to increase the carbon fljux into the carotenoid pathway. 

; : ) { I 

In such a case,* other strategies can be applied to decrease (not to jdisrupt) a gene expres- 
sion. One.of which is a conventional mutagenesis to screen the mutant whose expression 
for squaiene synthase is decreased. In|this method, an appropriate recombinant in which 
an appropriate reporter gene is fused ^o the promoter region of sjqualene synthase gene 
25 from the host organism is mutated and mutants which show a wfeaker activity of reporter 
gene product can be screened. In such mutants, it is expected that their expression of 
squaiene synthase activity decreased Vy the mutation lying in thejpromoter region of re- 
porter gene or trans-acting region which might affect the expression of squaiene synthase 
gene other than the mutation lying injthe promoter gene itself. In the case of mutation 

30 occurring at the promoter region of the reporter fusion, such mutation can be isolated by 
i 5 i i 

• the sequence of the corresponding region.] Thus isolated mutation can be introduced in a 
i t • I 

variety of carotenoids, especially asta:pntfcin producing mutants derived from P. rhodo- 

zyma by a'recombination between the original promoter for squklene synthase gene on the 

chromosome and the mutated promdter fragment. To exclude imitations occurring at 

35 trans-acting region, a mutation can also be induced by an in vitro mutagenesis of a cis 

i ! \ 
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element in the promoter region. In this approach, a gene cassette} containing a reporter 
gene which is fused to a promoter region derived from a gene of interest at its 5*-end and a 
terminator region from a gene of interest at; its 3'-end, is mutageriized and then introduced 
into P. rhodozyma. By detecting the difference of the activity of tlie reporter gene, an eflfec- 

tive mutatibn can be screened. Such aimutation can be introduced in the sequence of the 

' i I 
native promoter region on the chromosome by the same method *as the case of an in vivo 

mutation approach. But, these methods have some drawbacks to, 

suming process. j | 

Another strategy to decrease a gene expression is an antisense method. This method is 
frequently applied to decrease the gene expression even when telejomorphic organisms 
such as P. tfiodozyma are used as host organisms, to which the imitation and gene disrup- 

tion method is usually difficult to be applied. The anti-sense method is a method to de- 

\ * i * t 

crease an expression of gene of interest by introducing an artificial gene fragment, whose 

? - i \ 

sequence is complementary to cDNA fragment of the gene of interest. 



have some time-con- 



An 'antisense" nucleic acid molecule comprises a nucleotide sequence which is comple- 
mentary to a "sense" nucleic acid molecule ^encoding a protein, e.jg., complementary to the 
coding strand of a double-stranded cDNA molecule or complementary to a mRNA 
sequence. Accordingly, an antisense nucleic acid molecule can hydrogen bond to a sense 
nucleic acid molecule. The antisense nucleic acid molecule can Be complementary to an 
entire squalene synthase-coding Strang, or jto only a portion thereof. Accordingly, an anti- 
sense nucleic acid molecule can be antisense to a "coding region'! of the coding strand of a 
nucleotide sequence encoding a squalene synthase. The term "coding region" refers to the 
region of the nucleotide sequence comprising codons which are translated into amino acid 
residues. Further, the antisense nucleic acfd molecule is antisens'e to a "noncoding region" 
of the coding strand of a nucleotide sequence encoding squalenejsynthase. The term "non- 
coding region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into a polypeptide (i.e., also referred to as 5' and 3' mistranslated regions). 

if I 
Given the coding strand sequences encoding squalene synthase disclosed herein, antisense 

nucleic acid molecules of the invention can be designed according to the rules of Watson 
and Crick- base pairing. The antisens& nucleic acid molecule canlbe complementary to the 
entire coding.region of squalene synthase mRNA, but can also be an oligonucleotide which 
is antisense to only a portion of the coding or noncoding region! of squalene synthase 
mRNA. For example, the antisense oligonucleotide can be complementary to the region 
surrounding the translation start sitejof squalene synthase mRNk. An antisense oligo- 
nucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in 

: ': . j ! I 
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length. An antisense nucleic acid molelcule of the invention can bje constructed using 
chemical synthesis and enzymatic ligation reactions using procedures known. in the art 
For example, an antisense nucleic acid 'molecule (e.g., an antisensl oligonucleotide) can be 
chemically synthesized using naturallyioccurring nucleotides or v&riously modified 
5 nucleotides designed to increase the biplogibai stability of the molecules or to increase the 
physical stability of the duplex formed jbetween the antisense and|sense nucleic acids, e.g., 
phosphorothioate derivatives and acridine substituted nucleotide? can be used. Examples 
of modified nucleotides which can be used \o generate the an ti- sense nucleic acid include 
5-fluorouTacii, 5-bromouracil, S-chlorjourafil, 5-iodouracil, hypoxanthine, xanthine, 
10 4-acetylcytpsine/5-(carboxyhydroxylmethxl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxyiiiethylaminojnethLyluracii, dihydrouracil, beta-D-galactosyl- 
queosine, inosine, N6-isopentenyladenine,|l-methylguanine, 1-methylinosine, 2,2-di- 
methylguahine, 2-methyiadenine, 2-methylguanine, 3-methylcytosine, 5-methykytosine, 
N6-adenine, 7-methylguanine, 5-metliylaniinomethyluracil, 5-mLthoxyaminornethyl- 
15 2-thiouracili beta-D-mannosylqueosirie, S^methoxycarboxymethyluracil, 5- 

methoxyufacil, 2-methylthio-N6-isopenteriyladenine, uracil-5-oiyacetic acid (v), 
wybutoxosine, pseudouracil, queosine?, 2-tlliocytosine, S-methyl^-thiouracil, 2- 
thiouracil,!4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid niethylester, uracil-5- 
oxyacetic acid (v), 5-methyl-2-thiouracil, 3^(3-amino-3-N-2-carioxypropyl) uracil, 
20 (acp3)w, and 2,6-diaminopurine. Alternatively, the antisense nucleic acid can be 

produced biologically using an expression yector into which a polynucleotide has been 
subcloned in an antisense orientation«(Le.,[RNA transcribed from the inserted 
polynucleotide will be of an antisense jorientation to a target polynucleotide of interest, 

described further in the following subsection). 

« * * j £ 

: • | 

25 The antisense nucleic acid molecules pf the invention are typically administered to a cell or 
generated in situ such that they hybridize with or bind to cellulai mRNA and/or genomic 
DNA encoding a squalene synthase to' thereby inhibit expression! of the protein, e.g., by in- 
hibiting transcription and/or translation. (The hybridization caJbe by conventional 
nucleotide complementarity to form a stable duplex, or, for example, in the case of an 
30 antisense nucleic acid molecule whicti binds to DNA duplexes, through specific • 
interactions in the majoT groove of the double helix. The anti-sense molecule can be 
modified $uch that it specifically binds to | receptor Or an antigen expressed on a selected 
cell surface, e.g., by linking the antisense nucleic acid molecule to a peptide or an antibody 
which binds to a cell surface receptorjor antigen. The antisense nucleic acid molecule can 
also be delivered to cells using the vectors 'described herein. To achieve sufficient 

f v r 

intracellular concentrations of the antisense molecules, vector constructs in which the 

" ! : I I 
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antisense nucleic acid molecule is placed under the control of a strong prokaryotic, viral, 
or eukaryotic including plant promoters ar4 preferred. 

Further embodiment, the antisense nucleic acid molecule of the invention can be an 

: i i | 

CC-anomeric nucleic acid molecule. An; (X-anomeric nucleic acid molecule forms specific 

5 double-stranded hybrids with complementary RNA in which, contrary to the usual 

(3-units, the strands run parallel to each other. The antisense nucleic acid molecule can also 
comprise a^'-o-methylribonucleotide or a chimeric RNA-DNA analogue. 



Further the antisense nucleic acid molecule-of the invention can Be a ribozyme. Ribozymes 

If j 
are catalytic RNA molecules with ribonuclease activity which are 'capable of cleaving a 

10 single-stranded nucleic acid, such as a pRl^A, to which they have a complementary 

region. Thus, ribozymes (e.g., hammerhead ribozymes) can be ujjed to catalytically cleave 

squalene synthase mRNA transcripts to thereby inhibit translation of mRNA. A ribozyme 

having specificity for a squalene synthase-ehcoding nucleic acid molecule can be designed 

1 ? i 
based upon the polynucleotide sequence of the invention. For example, a derivative of a 

15 Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the 

active site is complementary to the nucleotide sequence to be cleaved in an encoding 

mRNA (US 4,987,071 and US 5,116,742.) Alternatively, squalen*? synthase mRNA can be 

used to select a catalytic RNA having a' specific ribonuclease actrvjity from a pool of RNA 

molecules.: * 5 \ 

■ ■ i i 

20 Application of antisense method to construct a carotenoid overproducing strain from P. 
rhodozyma was exemplified in EP 1,158,05L [ 

In one embodiment the present invention relates to a method of jmaking a recombinant 
host cell comprising introducing the vector or the polynucleotide of the present invention 
into a host cell. I \ 

25 Vector DNA can be introduced into prokaryotic or eulcaryotic cells via conventional trans- 
formation or transfection techniques.;* As ujsed herein, the terms | transformation" and 
"transfection", conjugation and transduction are intended to ref$r to a variety of art-re- 
cognized techniques for introducing foreign nucleic acid (e.g.i D^NA) into a host cell, 
including calcium phosphate or calcium chloride co-precipitation, DEAE : dextran- 

I * 5 I 

30 mediated transfection, lipofection, natural competence, chemical-mediated transfer, or 
electroporation. ' Suitable methods for transforming or t*ansfecting host cells including 
plant cells are known to the skilled pirson! 

■ ; .is 
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For stable transfection of mammalian . cells, it is known that, depending upon the expres- 
sion vector and transfection techniquejusedl only a small fractionjof ceils may integrate the 

foreign DNA into their genome. In order to? identify and select these integrants, a gene that 

j j i 
encodes a selectable marker (e.g., resistance^ antibiotics) is generally introduced into the 

5 host cells along with the gene of interest Preferred selectable markers include those which 
confer resistance to drugs, such as G418, hygromycin and methotrexate. Nucleic acid en- 
coding a selectable marker can be introduced into a host cell on the same vector as that en- 
coding the polypeptide of the present invention or can be introduced on a separate vector. 
Cells stably transfected with the introducedjnucleic acid can be identified by, for example, 
10 drug selection (e.g., cells that have incorporated the selectable marker gene will survive, 



while the other cells die). j \ 

] • J j 

To create a homologous recombinant microorganism, a vector isjprepared which contains 

at least a portion of the polynucleotide of the present invention into which a deletion, 

addition or substitution has been introduced to thereby alter, e.gi, functionally disrupt, the 

squalene synthase gene. Preferably, this squalene synthase gene is a P. rhodozyma squalene 

synthase gene, but it can be a homologue from a related or different source. Alternatively, 

the vector can be designed such that, upon homologous recombination, the endogenous 

I f I i 

squalene synthase gene is mutated or otherwise altered but still encodes functional protein 
(e.g., the upstream regulatory region can bi altered to thereby aitlr the expression of the 
20 endogenous squalene synthase). To ci eate(a point mutation via homologous recombina- 
tion also DNA-RNA hybrids can be used khown as chimeraplasty. 

I i 

The vector is introduced into a cell and cells in which the introduced polynucleotide gene 
has homologously recombined with ttiie endogenous squalene synthase gene are selected, 
using art-known techniques. < 5 

M 

25 Further host cells can be produced which contain selection systems which allow for regu- 

lated expression of the introduced gene. For example, inclusion (of the polynucleotide of 

I | S i 

• the invention on a vector placing it under control of the lac operpn permits expression of 

' * { 
the polynucleotide only in the presence of IPTG. Such regulatory systems are well known 

in the art. ( j 

' ' \ } 

30 . Preferably, the introduced nucleic acid molecule is foreign to the host cell. 

; I \ 

By "foreign 11 it is meant that the nucleic acid molecule is either heterologous with, respect 

to the host cell, this means derived from afcell or organism with ja different genomic back- 
ground, or is homologous with respefct to .the host cell but located in a different genomic 

\ } * 
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environment than the naturally occurring counterpart of said nucleic acid molecule. This 

J I f 
means that, if the nucleic acid molecule is homologous with respect to the host cell, it is 




not located in its Natural location in the genome of said host cell, in particular it is sur- 
rounded by different genes. In this casje the* nucleic acid molecule may be either under the 
control of its own promoter or under the control of a heterologous promoter. The vector 
or nucleic acid molecule according to the invention which is present in the host cell may 
either be integrated into the genome of the host cell or it may be maintained in some form 
extrachromosomally. In this respect, it is also to be understood that the nucleic acid mole- 
cule of the invention can be used to re$tore?or create a mutant gene via homologous re- 



10 combination. ! I 

Accordingly, in another embodiment the present invention relates to a host cell genetically 
engineered with the polynucleotide of [the invention or the vector; of the invention. 

! ; j i r 

The terms "host cell" and "recombinant host cell" are used interchangeably herein. It is 
understood that such terms refer not only to the particular subject cell but to the progeny . 
15 or potential progeny of such a cell. Because certain modifications may occur in succeeding 

generations due to either mutation or 'environmental influences, such progeny may not, in 

• S 5 ! 

fact, be identical to the parent cell, but are still included within the scope of the term as 

used herein. ! j 

I 

For example, a polynucleotide of the present invention can be introduced in bacterial cells 
20 as well as insect cells, fungal cells or nfammalian cells (such as CI inese hamster ovary cells 
(CHO) or COS cells), algae, ciliates, p|ant cells, fungi or other microorganims like R coK. 
Other suitable host cells are known toithose skilled in the art Preferred are E. coli 7 baculo- 
virus, Agrpbacterium or fungal cells are, for example, those of th(S genus Saccharomyces* 
e.g. those of the species S. cerevisiae oi P. rhodozyma (Xanthophylomyces dendrorhons). 



25 In addition, in one embodiment, the preseht invention relates to' a method for the produc- 

\ i 

tion of fungal transformants comprising the introduction of the polynucleotide or the vec- 

\ j a 

tor of the present invention into the genome of said fungal cell. { 
5 . ' * \ I i 

For the expression of the nucleic acid 'molecules according to the invention in sense or 

antisense orientation in plant cells, thfe molecules are placed under the control of regula- 

30 toiy elements which ensure the expression- in fungal cells. These' regulatory elements may 

be heterologous or homologous with xespect to the nucleic acid molecule to be expressed 

as well with respect to the fungal species to be transformed- | 

: . ! I . '! ■ 
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In general, such regulatory elements comprise a promoter active in fungal cells. To obtain 
constitutive expression in fungal cells, preferably constitutive promoters are used, such as 
the glyceraljdehyde-3-dehydrogenase pfomdter from P. rhodozymk (WO 97/23,633). In- 
ducible promoters may be used in order to be able to exactly control expression. An 

example for inducible promoters is the* promoter of genes encoding heat shock proteins. 

1 ■ i i jj 

Also an amylase gene promoter which is a candidate for such inducible promoters has 

been described (EP 1,035,206). The regulatory elements may further comprise transcrip- 
tional and/or translational enhancers functional in fungal cells, furthermore, the regula- 
tory elements may include transcription termination signals, sucli as a poly-A signal, 
which lead ! to the addition of a poly A tail to the transcript whichJmay improve its stability. 

Methods for the introduction of foreign D^IA into fungal cells are also well known in the 
art. These include, for example, transformation with LiCl method, the fusion of proto- 
plasts, electroporation, biolistic methods like particle bombardment other methods known 
in the art. Methods for the preparation of Appropriate vectors are known to the skilled 
artisan. Methods for the transformation using biolistic methods are well known to the 
person skilled in the art. j j | 



I 

The term "transformation" as used herein, refers to the transfer o*f an exogenous poly- 
nucleotide- into a host cell, irrespective of the method used for the transfer. The poly- 
nucleotide may be transiently or stably introduced into the host cell and may be main- 
tained non- integrated, for example, ak a plasmid or as chimeric links, or alternatively, may 

be integrated into the host genome. ( ( 

! I * i 

i ! 

In general; the fungi which can be modified according to the invention and which either 

' «* ! . { 

show overexpression of a protein according to the invention or a reduction of the synthesis 

of such a protein can be derived from|any kesired fungal species] 

II ' * j 

Further, in one embodiment, the present invention relates to a fungal cell comprising the 
polynucleotide the vector or obtainable btfthe method of the present invention. 



J 



Thus, the present invention relates also to transgenic fungal cellsj which contain (preferably 
stably integrated into the genome) a polynucleotide according to the invention linked to 
regulatory elements which allow expression of the polynucleotide in fungal cells and 
wherein the polynucleotide is foreignjto tibje transformed fungal tell- For the meaning of 
foreign see supra. I $ \ 

The presence* and expression of the polynucleotide in the transformed fungal cells modu- 
lates, preferably decreases the synthesis ofsquaiene and leads to the increase of the caro- 
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tenoids production, especially astaxanthin production in thus ob4ined transformed 
fungal cellsl preferably in P. rhodozyma cells. 

' ! ' I i I 

Thus, the present invention also relate? to transformed fungal cel|s according to the inven- 
tion. : ■ j [ | 

| j | 
5 Accordingly, due to the altered expression 6f squalene synthase, c<ells metabolic pathways 

are modulatedlin yield production, and/or efficiency of production. 

• • I I !' 

The terms "/production" or "productivity" are art-recognized and include the concentration 
of the fermentation product (for example fatty acids, carotenoids^ (poly) saccharides, 
vitamins, isoprenoids, lipids, wax estejs, anjd/or polymers like pojyhydroxyalkanoates 
10 and/or its metabolism products or further desired fine chemical ds mentioned herein) 

formed within a given time and a given fermentation volume (e.g\, kg product/hour/liter). 

; : ' i i j 

The term "efficiency" of production includes the time required for a particular level of pro- 
duction to be achieved (for example, How long it takes for the cell to attain a particular rate 
of output of ajsaid altered yield, in particular, into carotenoids, (poly)saccharides, lipids, 
15 vitamins, isoprenoids, etc.). j 



j » 



The term "yield" or "product/carbon jjidd'fc art-recognized andlincludes the efficiency of 
the conversion of the carbon source into uje product (i.e. acetyl fcoA, fktty acids, caroten- 
oids, vitamins', isoprenoids, lipids etcjand/pr further compounds as defined above and 
which biosynthesis is based on said products). This is generally Witten as, for example, kg 
product per kg carbon source. By increasing the yield or production of the compound, the 
quantity of recovered molecules, or of useful recovered molecules of that compound in a 
given amount of culture over a given junotmt of time is increased. 

( i « 
The terms "biosynthesis" (which is used synonymously for "synthesis" of "biological pro- 
duction" in cells, tissues plants, etc.) 6r a , >iosynthetic pathway'j are art-recognized and in- 
clude the synthesis of a compound, preferably an organic compound, by a cell from inter- 
mediate compounds in what may be a multistep and highly regulated process. 

. ■ I • • ! j | 

The language "metabolism" is art-recpgni|ed and includes the totality of the biochemical 
reactions that take place in an organism. Jrhe metabolism of a particular compound, then, 
(e.g., the metabolism of acetyl CoA, a fatty acid, hexose, lipid, ispprenoid, vitamin, caro- 
tenoid etc.) comprises the overall bi Jsynthetie, modification, and degradation pathways in 
the cell related to this compound. 
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Such a genetically engineered P. rhodozyma would be cultivated irj an appropriate medium 
and evaluated in its productivity or/and yield of carotenoids, especially astaxanthin. A ' 
hyper producer of astaxanthin thus selected! would be confirmed fn view of the 
relationship between its productivity afid the level of gene or protein expression which is 

introduced by such a genetic engineering method. 

I i i 

1 i 1 ] 

The present invention is further illustrated with Examples described below. 

If ! 

The following materials and methods were employed in the Example described below: 

| • j \ 
Strains ! . t J \ 

P. rhodozyma ATCC96594 (re-deposited un*der the accession No JaTCC 74438 on April 8, 



10 1998 pursuant to the Budapest Treaty); ! 

i s 



R coli DH5a: F> <j>80d, ZacZAMlS, A(^cZYAwirgF)Ul69, hsd (r K " 
deoR, tW-1, 5UpE44, gyrA96, relAl (Toyoboi Osaka, Japan) j 



niK + ), recA\> endAU 



E. coliXLl "MRA (P2): A(mcrA)l83, A(Wr<SB-WSMR-mrr)173, kndAl, supE44, 
gyrA96> relAl, lac (P2 lysogen) (Stratagene,|La Jola, U.S.A.) J 



15 Vectors ! 
XDASHII (Stratagene) I 
pBluescriptU KS- (Stratagene) j j 

pMOSBlueT- vector (Amersham, Buckinghamshire, U.K.) 

I i i 

: I 5 

Media , : I 

, : | s j 

20 P. rhodozyma strain was maintained routinfely in YPD medium (DIFCO, Detroit, U.S.A.). 

R coli strain was maintained in LB medium (10 g Bacto-tryptonjs g yeast extract (DIFCO) 

( i f 
and 5 g NaCl per liter). NZY medium (5 gdNaCl, 2 g MgSO^THjO, 5 g yeast extract 

(DIFCO), 10 g NZ amine type A (WAKO, Dsalca, lapan) per liter) is used for X phage 
propagation in a soft agar (0.7 % agarj(WAKO)). When an agarjjmedium was prepared, 
25 1.5 % of agar (WAKO) was supplemented^ j 

; : IS- * " 

Methods ! ■ ■ i j j 

Restriction enzymes and T4 DNA iigase were purchased from Takara Shuzo (Ohtsu, 



Japan). ; j { 

Isolation of a chromosomal DNA from P. rhodozyma was performed by using QlAGEN 
30 Genomic Kit (QlAGEN, Hilden, Germany) following the protocol supplied by the manu- 
facturer. Mini-piep of plasmid DNAifrom transformed £ coli was performed with the . 
Automatic DNA isolation system (PL$50, Kurabo, Co. Ltd., Osaka, Japan). Midi-prep of . 

! • i\ I 
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plasmid DNA from an E. colt transformant was performed by using QIAGEN column 
(QIAGEN), Isolation of X DNA was performed by Wizard lambda preps DNA 
purification system (Promega, Madisop, U.SA.) following the protocol prepared by the 
manufacturer. .A DNA fragment was isolated and purified from agarose by using 

5 QIAquick or QIAEX IT (QIAGEN). Manipulation of X phage derivatives was followed by 
the protocol prepared by the manufacturer ;(Stratagene). \ 
Isolation of total RNA from P. rhodozyma was performed with phenol method by using 
Isogen (Nippon Gene* Toyama, Japan). mRNA was purified from total RNA thus ob- 
tained by using mRNA separation kit (Clontech). cDNA was synthesized by using 
1 o CapFinder :cDNA construction kit (Clonteqh) . | 

In vitro packaging was performed by ujsing Gigapack III gold packaging extract (Strata- 
gene), t ft j 

Polymerase chain reaction (PGR) is performed with the thermal cycler from Perkin Elmer 

model 2400. Each PGR condition is described in examples. PCR^primers were purchased 
1 its 
15 from a commercial supplier. Fluorescent DNA primers for DNAj sequencing were pur- 
chased from Pharmacia. DNA sequencing was performed with the automated fluorescent \ 
DNA sequencer (ALFred, Pharmacia).; ( 

j jr 

Competent cells of DH5a were purchased fromToyobo (Japan) J 

i \ 
i I 

Example 1; . Isolation of mRNA from P. rhodozyma and construction of a cDNA library 

; •• j| i 

20 To construct a cDNA library of P. rhodozyrna, total RNA was iso|ated by phenol extraction 

method right after the cell disruption and the mRNA from P. rhodozyma ATCC96594 

strain was purified by using mRNA separation kit (Clontech). \ 

Cells of strain ATCC96594 from 10 ml of a two- day-culture in YPD medium were 

1 \ ! 
harvested by centrifugation (1500 x gfor 10 min.) and washed once with extraction buffer 

25 (10 mM Na-citrate / HC1 (pH 6.2) containing 0.7 M KC1). After suspending in 2.5 ml of 

extraction buffer, the cells were disrupted by French press homqgenizer (Ohtake Works 

Corp., Tokyo, Japan), at 1500 kgf/cm^ n d immediately mixed wjth two times of volume of 

! ? \ 

isogen (Nippon gene) according to the method specified by the manufacturer. In this step, 
400 \lg of 'total RNA was recovered, j { j 
30 Then, this total RNA was purified by using mRNA separation kit (Clontech) according to 
the method specified by the manufacturer* Finally, 16 jig of mlJlSfA from P rhodozyma 
ATCC96594 strain was obtained. | | . j 

To constructcDNA library, CapFinder PGR cDNA construction kit (Clontech) was used 
according to the method specified by the manufacturer. One jig of purified mRNA was 

» 4 
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applied for a first strand synthesis followed by PCR amplification, 
by PCR, 1 ihg of cDNA pool was obtained. 1 



After this amplification 



Example 2:[ Cloning of a partial SQSi (squalene synthase) gene from P. rhodozyma 

Ml 

odozyma, 



To clone a partial SQS gene from P. rhodo 



a degenerate PCR method was exploited. 



Species and accession number to database whose sequence for squalene synthase were used 



for multiple alignment analysis are as follows. 
Ustilago maydis Q92459 (SwissProt) 

Schizosaccharomyces pombe P36596 (SwissProt) 

♦ * \ 

Saccharomyces cerevisiae M63979 (GenBank) 

Rattus noTvegictis 



Mus musculiis : 
Candida albicans 
Homo sapiens ■ 
Arabidopsis thaliana 
Leishmania major 
Glycyrrh iza glabra 



Q02769 (SWissProt) 
P53798 (SwissProt) 
P78589 (Siviss?Tot) 
138245 (Pur) j 



U79159,AF004396 
U30455 \ ' 
D86410 [ j 

\ 



Two mixed primers whose nucleotide Ssequfences were designed and synthesized based on 
the common sequence of known squalene synthase genes from other species, i.e. squl 
(sense primer) (SEQ ID NO:4), squ4 (antisjense primer) (SEQ IdJ NO:5) and squ5 (anti- 
sense primer) (SEQ ID NO:6) (in the Sequences V means nucleotides a, c, g or t, Y 
means nucleotides a or g, and "y" means nucleotides c or t). 



After the ^CR reaction of 25 cycles ofp5°Cj for 30 seconds, 45°C for 30 seconds and 72°C 
for 15 seconds by using ext (Takara Shuzo) as a DNA polymerase and cDNA pool ob- 
tained in example 1 as a template, thejreaction mixture was applied to agarose gel electro- 
phoresis. Each PCR band that had a desired length was recoverejd from the PCR reaction 
mixture in which the combination using squl and squ4> and sqril and squ5, respectively 



and purified by QlAquick (QIAGEN)- according to the method tjy the manufacturer and 
then ligated to pMOSBlue-T-vector (Xmersham). After transformation of competent E. 

I ■ ? 5 5 

coli DH50, 6 white colonies were selected £nd plasmids were isolated with Automatic DNA 

! J ! 
30 isolation system. As a result of sequencing, it was found that 3 clones had a sequence 

whose deduced amino acid sequence jwas Similar to known squalene synthase genes. These 

isolated cDNA clones were designated as pSQS1007 derived from the PCR reaction using 

squl and squ5 and as pSQS1006 derived from the PCR reactionjusing squl and squ4, and 

pSQS1006 was used for further screening study. 

» i 

:• \ \ 
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Example 3: Isolation of genomic DNA from P» rhodozyma \ 

\ i ; j { 

To isolate a genomic DNA from P. rhodozyma* QIAGEN genomic kit was used according 

i I \ 
to the method specified by the manufarturer, j 

CeDs of P. rhodozyma ATCC96594 from 100 ml of overnight culture in YPD medium were 
5 harvested by centrifugation (1500 x g for id min.) and washed onfce with TE buffer ( 10 
mM Tris / HC1 (pH 8.0) containing 1 mM EDTA). After suspending in 8 ml of Yl buffer 
of the QIAGEN genomic kit, lyticase (SIGMA, St. Louis, U,S,A.) was added at the concen- 
tration of 2 mg/ml to disrupt cells by enzymatic degradation and the reaction mixture was 
incubated for 90 minutes at 30°C and then proceeded to the next [extraction step. Finally, 
10 20 Jig of genomic DNA was obtained. | j } 



Example 4: 



Southern blot hybridizitioreby using pSQS1006 aS a probe 



Southern blot hybridization was performed to clone a genomic fragment which contains 
SQS gene from P. rhodozyma. Two p,g of genomic DNA was digested by EcoRI and sub- 
jected to agarose gel electrophoresis followed by acidic and alkalipe treatment The de- 
is natured DNA was transferred to nylon membrane (Hybond N+, jAmersham) by using 
transblot (Joto Rika, Tokyo, Japan) for an Jiour. The DNA which was transferred to nylon 
membrane was fixed by a heat treatment ($0°C, 90 min). A probe was prepared by 
labeling a template DNA (EcoRI-digested pSQSl006) with DIG j|iultipriming method 
(Boehringer Mannheim). Hybridization was performed with th4 method specified by the 

1 i i I 

20 manufacturer! As a result, a hybridized bahd was visualized in the range from 9.0 to 23.0 
kilobases (kb). j j 

Example 5: . Cloning of a genomic fragnjent containing SQS gene 

: . i | | 

Four \i% of the genomic DNA was digested by BcoRI and subjected to agarose gel electro- 

: 3 ' J 
phoresis. Then, DNAs whose length is within the range from 9.0 to 20.0 kb was recovered 

25 by QI AEX II gel extraction kit (QIAGEN) according to the method specified by the manu- 

; i . t j | 

facturer. The purified DNA was ligatfed to;0.5 fxg of EcoRI-digesfed and CIAP (calf 

J I | 
intestine alkaline phosphatase)-treated XDASH II (Stratagene) at 16°C overnight, and 

packaged by Gigapack III gold packaging extract (Stratagene). ^he packaged extract was 

infected to E. coU MRA(P2) strain anjl ovcfr-laid with NZY medium poured onto LB agar 

30 medium. About 5000 plaques were screened by using EcoRl-digested pSQS1006 as a 

probe. Five plaques were hybridized ,to the labeled probe. \ 

1 ■ \ \ i . 

! ! 
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This A,DASH II derivative containing putative SQS gene from P. rhodozyma was prepared 

1 ' I I 

by using Wizard lambda preps DNA purification system (Promega). Next, PCR was con- 

f \ f 
ducted by using these XDASH II derivatives as a template and two primers, squ9 and squlO 

i * i j 

as primers.; These squ 9 and squlO prirners;were designed based on the internal sequence 

5 of pSQ$1006: squ9 (sense primer) (SEQ ID;NO:7) and squlO (anfisense primer) (SEQ ID 

NO:8). \ \ j 

As a result |of PCR under the same PQt condition as described injExample 2, an expected 
0.5 kb band was yielded. It was suggested that all of those XDASH II derivatives might 
contain a putative SQS gene from P. rhodozyma. Approximately 20.0 kb EcoRI insert 
10 fragment in orie of these XDASH II derivatives was purified by using QlAEX II (QIAGEN) 
and subjected to subcloning into pBIuescriptll KS- vector (Stratajgene) by using DH5a as a 
host strain and yielded pSQl229. \ I j 

I Is t 

Example 6': Sequencing of a genorrwc fragment containing SQS gene 

* i \ 

pSQS1229* was sequenced with primed walking procedure by using AutoRead sequencing 

15 kit (Pharmacia). \ \ j 

As a result of sequencing, a nucleotide sequence comprising 480^ base pairs of a genomic 

fragment containing SQS gene from i>. rhodozyma containing itsjpromoter (1549 bp) and 

terminator (836 bp) was determined (SEQjlID NO:l). j 

The coding region was 2422 base pairs long consisting of 9 exons and 8 introns. Introns 
20 . were dispersed all through the codingjregion without 5' or 3' bias. It was found that an 
open reading frame (SEQ ID NO:2) consists of 512 amino acids (SEQ ID NO;3) whose 
sequence is strikingly similar to the known' amino acid sequencefof squalene synthase from 
other species (51.3% identity to squalene synthase from SchizosJccharomyces pombe) as a 
result of homology search by GENET^X-SV/RC software (Software Development Co., 

25 Ltd,, Tokyo, Japan). ] I 

I • | j 

Example 7: Construction of antisense glasmid for SQS gene 

; ■ ! \ 

An antisense gene fragment which covers the entire structure gene for SQS gene is ampli- 
fied by PCR method and then donedjinto {integration vector in tohich antisense SQS gene 



is transcribed by its own SQS promoter in* P. rhodozyma. 



30 Such primers include asymmetrical recognition sequence for restriction enzyme, Sfit 
(GGCCNNNNNGGCC) but their asymmetrical hang-over sequence is designed to be 
different. This enables a directional cloning into expression vector which has the same 



i t 

i i 



20 



30 
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asymmetrical sequence at their ligation; sequence- The usage of such a construction i$ 



exemplifiedin EP 1,158,051. 



\ 



For the promoter and terminator fragment Which can drive the transcription of the anti- 

• f > 5 J 

sense SQS gene, SQS promoter and terminator is cloned from the chromosome by using 
the sequence information listed in SEQ ID ho:L 

Next, SQS terminator fragment is fused to G418 resistant cassettejby ligating the DNA 
fragment containing SQS terminator to G418 resistant cassette of:pG418Sa330 (EP 
1,035,206) to appropriate vector such as pBluescriptll KS- (Strat^gene). 



Then, 3.1 kb of Sad fragment containing ribosomal DNA (rDNA|) locus (Wery et aL, 
10 Gene, 184, ;89-97, 1997) is inserted in the downstream of G418 cassette on thus prepared 
plasmid. rDNA fragment exists in multicopies on the chromosome of eukaryote,. The 
integration event via the rDNA fragment wpuld result in multicopied integration onto the 

chromosome of the host used and this; enables the overexpressioi of foreign genes which 

i i • 

are harbored in expression vector* \ | 

15 Subsequently, SQS promoter is inserted in the upstream of SQS terminator to construct of 

■ 8 i 
expression vector which functions in P. rhodozyma. I 

Finally, antisense SQS construct is completed by inserting the L5kb of SfH fragment con- 
l into thus prepaf ed expression vector functioning in P. rhodozyma. 



taining antisense SQS into thus prepared expr< 



A similar plasmid construction is exemplified in EP 1,158,051. 

■ • | | 

Example 8: Transformation of P. rhodozyma with SQS-antisehse vector 

i i i 

The SQS-antisense vector thus prepared isftransformed into P. rJtodozyma wild type strain, 

ATCC96594 by biolistic transfonnatiim following the protocol described in EP 1,158,051. 

1 : i 1 

l * j 
Example 9: Characterization of antisense SQS recombinant of P. rhodozyma 

, . i I f 

Antisense SQS recombinant of P. rhodozyma, ATCC96594 is cultured in 50 ml of YPD 
25 medium in 500 ml Erlenmeyer flask ajt 20°C for 3 days by using their seed culture which 
grows in 10 ml of YPD medium in test tubes (21 mm in diameter) at 20°C for 3 days- For 
analysis of carotenoid produced appropriate volume of culture $roth is withdrawn and 
used for analysis of their growth, productivity of carotenoids, especially astaxanthin. 
For analysis of growth, optical density at ^60 nm is measured b)| using UV-1200 photo- 
meter (Shimadzu Corp., Kyoto, Japap) injaddition to the deterrjiination of their dried cell 
mass by drying up the cells derived from j ml of broth after microcentrifugation at 100°C 
for one day, * j 



p 

t 

j 



20 = 6 



I 

i 



For analysis of content of astaxanthin and total carotenoids, cells are harvested from 1,0 ml 
of broth after microcentrifugation andjusedifor the extraction of the carotenoids from cells 
of P. rhodozyma by disruption with glass beads. After extraction, disrupted cells are re- 

is I 

moved by centrifugation and the resultant is analyzed for carotenbid content with HPLC. 
5 The HPLC condition used is as follows^ HPLC column; Chrompajbk Lichrosorb si-60 (4.6 
mm, 250 mm); Temperature; room terhper^ture; Eluent; acetone >/ hexane (18/82) add 1 

ml/L of water to eluent; Injection volume; 10 |jl; Flow rate; 2.0 ml/minute: Detection; UV 

I I j 
at450nm. : i l 

Reference samples can be obtained frojn Hoffmann La-Roche (Basel, Switzerland) (parti- 
10 cularly astaxanthin) or other commercial suppliers (WAKO, SIGMA, etc.) 

: i c 
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: ; i 

1. An isolated polynucleotide comprising a nucleic acid molecule tone or more selected 



I 



from the group consisting of j 

(a) nucleic acid molecules encoding atleastjthe mature form of the polypeptide depicted in 
5 SEQIDNO:3; \ } j 

f t ' 

(b) nucleic acid molecules comprising'the coding sequence as depicted in SEQ ID NO:2; 

i { \ : 

(c) nucleic acid molecules whose nucleotide sequence is degenerate as a result of the 

* \ [ 
genetic code to a nucleotide sequence pf (a) or (b); 

(d) nucleic acid molecules encoding a polypeptide derived from the polypeptide encoded 

• ! \ ■ \ 

10 by a polynucleotide of (a) to (c) by way of substitution, deletion and/or addition of one or 

• i p 

several amino acids of the amino acid sequence of the polypeptide encoded by a nucleotide 
of (a) to (c); { j j 

(e) nucleic; acid molecules encoding a polypeptide derived from die polypeptide whose 
sequence has an identity of 51.3 % or more to the amino acid sequence of the polypeptide 

15 encoded by a nucleic acid molecule of (a) or (b); j 

(f) nucleic' acid molecules comprising'.a fragment encoded by a nucleic acid molecule of 

i * f 

any one of (a)' to (e) and having squalene synthase activity; \ 

(g) nucleic acid molecules comprising a polynucleotide having ajsequence of a nucleic acid 
molecule amplified from a Phaffia nucleic acid library using the primers depicted in SEQ 

20 ID NO:4, 5, and 6; j 5 



L 



(h) nucleic add molecules encoding a - polypeptide having squalene synthase activity, 
wherein said polypeptide is a fragment of a polypeptide encoded by any one of (a) to (g); 

(i) nucleic acid molecules comprising-, at lekst 15 nucleotides of ajpolynucleotide of any one 
of (a) to (d);. j 1 | 

(j) nucleic acid molecules encoding a polypeptide having squalene synthase activity, 

wherein said polypeptide is recognized by 'antibodies that have b*een raised against a 

polypeptide encoded by a nudeic acid molecule of any one of (a} to (h); 

(k) nucleic acid molecules obtainable'by screening an appropriate library under stringent 

conditions with a probe having the sequence of the nucleic acid molecule of any one of (a) 

to (j), and encoding a polypeptide haying jsqualene synthase actiWy; 

(1) nucleic acid molecules whose complementary strand hybridizes under stringent 

conditions with a nucleic add molecule of any one of (a) to (k) jand encoding a 

• \ i t I 
polypeptide having squalene synthase activity. s 

• * * \ 

• ' j " • 5 

2. An isolated polynucleotide comprising a nucleic acid molecule one or more selected 



35 from the group consisting of. 



S 



! 



f 
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: ! i 

(m) nucleic acid molecules comprising the nucleotide sequence as depicted in SEQ ID 

NO:l; j . I !' 

(n) nucleic acid molecules whose nucleotide sequence is degenerate as a result of the 
genetic code to a nucleotide sequence <>f (m[); j 
5 (o) nucleic acid molecules encoding a polypeptide derived from tjie polypeptide encoded 
by a polynucleotide of (m) or (n) by wpy ofsubstitution, deletion! and/or addition of one 
or several amino acids of the amino acid sequence of the polypeptide encoded by a 
nucleotide of (m) or (n); II j 

(p) nucleic acid molecules encoding a polypeptide derived from the polypeptide whose 
10 sequence has an identity of 51.3 % or morelto the amino acid sequence of the polypeptide 
encoded by a nucleic acid molecule of (m); j t 
(q) nucleic acid molecules comprising'a fragment encoded by a nucleic acid molecule of 
any one of:(m) to (p) and having squajene Synthase activity; \ 

(r) nucleic acid molecules comprising a polynucleotide having a sequence of a nucleic acid 

15 molecule amplified from a Phaffia nucleic acid library using the primers depicted in SEQ 

ID 1*0:4,5, and 6; j j j 

(s) nucleic acid molecules encoding a polypeptide having squalerie synthase activity, 

wherein said polypeptide is a fragment of alpolypeptide encoded Ly any one of (m) to (r); 

if \ 
(t) nucleicjacid molecules comprising *at least 15 nucleotides of a polynucleotide of any one 

20 of(m)to(o); ; ? \ 

* i I 
(u) nucleic acid molecules encoding afpolypeptide having squalejie synthase activity, 

wherein said polypeptide is recognized by antibodies that have been raised against a 
polypeptide encoded by a nucleic acid| molecule of any one of (rn) to (s); 
(v) nucleic acid molecules obtainable by screening an appropriate library under stringent 
25 conditions with a probe having the sequence of the nucleic acid rholecule of any one of 
(m) to (xx)[ and encoding a polypeptide having squalene synthasf activity; 
(w) nucleic acid molecules whose complementary strand hybridises under stringent 
conditions with a nucleic acid molecule of?any one of (m) to (v)jand encoding a 
polypeptide having squalene synthase) activity. 

j j 

30 3- The isolated polynucleotide of claim 1 or 2, wherein said polynucleotide encodes an 



amino acid sequence which is identified by SEQ ID NO:3 or has [an identity of 513 % or 
morewithSEQIDNO:3. j j j 

4. The isolated polynucleotide of any one of claims 1 to 3, wherein said polynucleotide is 
derived from a strain of Phaffia rhodozyma or Xanthophylomyces dendrorhous. 
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5. A method for maldng a recombinant vector comprising inserting the polynucleotide of 
any one of claims 1 to 4 into a vector, j \ 

: i j | 

6. A recombinant vector containing thfe polynucleotide of any one of claims 1 to 4 or 
produced by the method of claim 5. j * 

: ? 

5 7. The vectpr of claim 6 in which the polynucleotide of any one of claims I to 4 is 

i 1 f i 

operatively linked to expression control sequences allowing expression in prokaryotic or 
eukaryotic cells. J | 

8. A method of making a recombinant; orgahism comprising intrpducing the vector of 

claim 6 or 7 into a host organism. » ! 

; J ? 
; \ t 
10 9. The method of claim 8, wherein said host organism is selected jfrom R colu baculovirus, 

or S. cerevi$iae> \ j \ 

j j I 

10. The recombinant organism containing the vector of claim 6 c>r 7, or produced by the 
method of claim 8 or 9. \ * \ 

i i i 

1 1. A process for producing a polypeptide having squalene synthase activity comprising 
15 culturing the recombinant organism df claim 10 and recovering ihe polypeptide from the 

' . ! i S 

culture of said recombinant organism* ; 5 

j i ! 

12. A polypeptide obtainable by the process of claim 11. j 

M i 

13. An antibody that binds specifically to the polypeptide of claim 12. 

! ! 1 • ' 

14. An antisense polynucleotide against the polynucleotide of any one of claims 1 to 4. 



5 

i t 

20 15. A method/or making a recombinant vector comprising inserting the polynucleotide of 

' • i \ ^ 

claim 14 into avector. I i l 

; : ; j j 

16. A recombinant vector containing the polynucleotide of claim 14 or produced by the 
method of claim 15. ! [ | 

' 5 f 

I : j 

17. The vector of claim 16 in which the polynucleotide of claim 14 is operatively linked to 

25 expression control sequences allowing expression in prokaryotic or eukaryotic cells. 

: : I | j 

18. A method of making a recombinant organism comprising introducing the vector of . 
claim 16 or 17 into a host organism, j | j 

: | I I 



t 

\ i 

: I 
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19. The method of claim 18, wherein said host organism belongs io a strain of Phaffia 



rhodozyma or Xanthophylomyces dendrorhous. 

■ ! i 

20. The recombinant organism containing the vector of claim 16 

j < 

method of claim 18 or 19. ! \ 



ur 17, or produced by the 



21. The recombinant organism of claim 20,?wherein said organisin is characterized in that 
whose gene expression of squalene synjthase^ is reduced compared] to the host organism, 
thereby is capable of producing carotehoids in an enhanced level relative to a host 



organism. 



22. The recombinant organism according t6 claim 21, wherein the gene expression of 
squalene synthase is reduced by means of the technique selected from antisense 
technology, site-directed mutagenesis,? error prone PGR, or chemical mutagenesis. 

j j | 

23. A process for producing carqtenoiSs, which comprises cultivating the recombinant 
organism of claim 21. ? 

24. The process of claim 23, wherein said c^rotenoids are selected one or more from asta 



xanthin, {^carotene, lycopene, zeaxanthin jcanthaxanthin. 



25. The process according to claim 23j whelcein the gene expression of squalene synthase is 
reduced in the recombinant organism* of datm 21 by means of the technique selected from 
antisense technology, site-directed mutagenesis, error prone PGR, or chemical 
mutagenesis. : { 



26. A process for the production of a carotenoid by culturing a microorganism under suit- 
able conditions and, optionally, recovering the resulting carotenoid, wherein the micro- 
organism is characterized in that its gene expression of squalene Synthase is reduced, e,g. 
by means of the technique selected from antisense technology, site-directed mutagenesis, 
error prone PfcR, or chemical mutagenesis^ f 
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<110> 


Roche Vitamins AG 




<120> 


SQS gene 




<130> 


NDR52Xo 


5 


<160> 


8 




<170> 


Pacentln version 3 




<210> 


1 * 




<211> 


4807 




<212> 


DNA 


10 


<213> 


Phaff ia rhoaozyma 




<220> 






<221> 


exon 




<222> 


* mm mm tm • / -* f~~ ^ r| % 

(1550) . . (1577) 

: 
I 




<223> 


15 


<220> 


! 




<221> 


polyA_site 




<222> . 


(4106) . . (4107) 




<223> 






<:220> 




20 


<221> 


5'UTR 




<222> 


(1469) . - (1470) 
• 




<223> 


i 




<220> 


i 
. 




<221> 


exon 


25 


<222> 


(3959) . . (3970) 




<223> 






<220> 






<221> 


intron 




<222> 


(3882) - . (3958) 


30 


<223> 


! 

1 

I 




<220> 






<221> 


exon' 




<222> 


(3567) . • (3881) 




<223> 


i , 


35 


<22 0> 


■ 




*221> 


Intron 




<222> 


(3476) . . (3564) 
• 




<223> 


i 




<220> 


• 


40 


<221> 


exon 




<222> 


(3454) . , (3474) 




<223> 






<220> 






<221> 


Intron 



<222> 
<223> 
<220> 

5 <222> 
<223> 
<220> 
<221> 
<222> 

10 <223> 
<220> 
<221> 
<222> 
<223> 

15 <220> 
<:221> 
<222> 
<223> 
<220> 

20 <221> 
<222> 
<223> 
<220> 
<221> 

25 <222> 
<223> 
<220> 
<221> 
<222> 

30 <223> 
<220> 
<221> 
<222> 
<223> 

35 <220> 
<221> 
*222> 
<223> 
<220> 

40 <221> 
<222> 
<223> 



(3357) . . (3453) 



exon 
(3231) 



(3356) 



Intron 

(3088) - . (3230) 



i 

exon 

(2476) . • (3087) 



Intron 
(2398) . 

i i 
eion* 
(2183) . 



Intron 
(2072) . 



exon* 



(2474) 



(2395) 



(2182) 



(i883) . . (2071) 



Intron 
(1767) . 

! . . 
i ! 

exon' 
(1755) , 



Intron 
(.1578) . 



. (1882) 



. (1766) 



<1752) 



<400> i 



\40 



1 \ 
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gttccngttc 

I 

ggtcagaaca 

* 

5 gtgaggtttg 

i 

gatgaacgac 
ttccagcttt 
agtcgatgtt 
ggaggacagg 
15 tcgaaggtcg 

atttcctfctt 

i 

cfcgaccgatc 



aggttcatgt 



agcaggctca 



25 gtcgagattc 



gcgggggate 



tcataggtga 



cgacttgggc 

tggcgggttc 
i 

35 .aatgacagtt 
actgaagaag 

cttgatcaga 
40 ctgctctttc 
■ 

fcgtgctfogcc 

i 

i 
i 

cagagacgac 



agtcaaagag 

tcggagatac 
i 

cctaggaagt 
acgtcgaacc 
tctgctctct: 
cfcg tcgacag 
gtggctttaa 
acfccctcttg 
ctttctaccc 
cgaatatcta 
ttgaaagcat 
atgatcactt 
ntttttcttc 

acccgcatat 

t 

aggttaaaac 

agcctcgtcc 

gtcatggccg 

jtcecgacgcc 

gtagggtctc 

cccgacttat 
gtgtttttca 



Mi- 



tgggaaaaac acgaaagtaa aaagatgtaa tgaaagaagg 

i I 

: i 

aatggcccat agaggaagga aagctactta ccagaaacca 



aatcccttcg ttitctcaaag atatctttnt tgaaagcatc 



I 



f 



catctccatfc ctfcgaaatca agtttacfcpg atctagacct 

i j* 

t t 

ccagtttcgc agctttctct tcgggaagaa gctctccgcc 

i i 

gagaccagta gaaggcggaa ccgacaattt tggatggatc 

it ! 

caaatcggca gtacggagga tcgaacggcg cttctctcgt 



ccatgtgtat gagagcatat ccgttgatgt cecagttaaa 



ggagagtaag acacacaaag aatcacgaag aatatgatga 

i a 

i i 

gcgcaggttg ctitctctact ggttccat.tc ttcgaacgat 

it i 

cgatcctagt tgcctctatc fcgaggccagt ctgccaatgt 



i 



.eft 



gg&fifttcgttg catctcgatg ttcaaccaag tgtcgcaacg 



ttttggtcgra gaaaaaaaaa cggcttcgct tcgcacgcgc 



i 



L 



taagcggtat gacgctcatc aaccggccaa gtgttctcca 



i 



i 



ggaatggata g^aggagcta accacgtttt tatcttaatt 



atagtgtctg atggttacat cgtcatagaa aggcagcgcc 

if / 

I I i 

tgatcatctg ctttgttaga cattgtccat cagtcacctc 



1 



afccactaaga cacaaacgta tccagcaqgc catgtccatc 



gtcgagccag tgcaaccaga gfctacagafcg aacatcaggc 



gaatatggec g;tfcatfcgtac acttcttggt gctcctcgag 



ctfctcttccc gfgatcaaacg agactgcfccg tgtacccatc 



atatgagcat cccatgccftc tgctcaaatg ^tgctggage tacgatccat 

i 5 

acaaaacggg gttgtatg;aa ctctacattt 



cctaatgtta ttggaatttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
1260 

1320 

1380 



10 



15 



20 
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ctgtaatgcg ttcttcatct ttctctaatg ct'tttttgta gtccgncctt tcaacctcgc 

! I • 

cagcgttccg cgtgtcttct ttctcctct'g acggtcatca ctttcttctc tcttctcgct 

j j. 

ctttcttcc'g tccttccttc tctctcttcg tc^tgaacatc agcatcat£ atg ggc ata 

• ■ j Met Gly He 

• • . i 

j 

tea gac tac ;Ctc gtt ctg g gtcagttetg tcttttgttt gattlttatc 
Ser Asp Tyr Leu Val Leu j 
5 



i 



ttcttgeegg cggtcgcctg tcttgggtat atcatcagca atgagaaaca tgatgttccc 



1 



cccgcgtcaa tcactgacct tttggtccdc tacttctttc ctgtcgaafet gatcctgatt 

] ! 



gatacgtgtg ceggetgett aacagct tit pac gca tec t 

Val His Ala ser 

5 



gtagg 

f 

s 

\ 



tgttt 



tategtatge ttcatgttga tgtttagtca cgcggactga cctggccg/gt tgattttctg 



: i ? 
tatgatcget tgtgctaccg tctttcttgg aaatccttcc catcag g*c cga tct 



Cys Arg Ser 

i is 



20 



1 2S i 

i 



r 

Ala Ser $Lys <3lu Tyr 
I 30 • 



cac tgc aca gga gga aca tgc aac ate egg ttg gga ccg 
His Cys Thr Gly Gly Thr Cys Ash. lie Arg Leu Gly Pro 



35 



gaa gga atg ttg gaa gta ttt 



| 40| 



gga 



5 



1440 
1500 
15S8 



aga aac tat 
Arg As ii Tyr 
45 



tot gac ttc aag aag^ttt cgc age 
Glu Gly Met Leu Glu Val Phe Gly Ser Asp Phe Lys Lya?Phe Arg Ser 

50 S5J j 60 j 

tgt cat caa aga gtt gga egg agatct tac ccg agt c geacgtgttt 



Cys His Gin Arg Val Gly Arg Arg Se'r Tyr Pro Ser 



65 
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tcatcttctc nctcctttga gatctggtcg cjctccgcatt ttcttgttgc agaagggeca 
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1667 
1727 
1776 



1836 
1890 



gcg age ttt'aat gea gta cgc ga£ ctg gca tga gec teg faag gaa tat 193 8 

Ala Ser Phe Asn Ala val Arg Asp Leu Ala 
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gaagctgaca acaccatccc tactgctcgg ga[cacggcta g at ct£ ttt att eta. 

! { His Leu Phe lie* Leu 

\ \ 2 80 
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cgc tct teg agg act gga tac! cat tga gga tga cat gag tct ate 



Ser Arg Ser Ser Arg Thr Gly Tyr; His 



85 



Gly 
90 



His Glu Ser lie 
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taa tga tgt gaa get tec cct gctj teg gac att ctg gga aaa get tga 



Cya Glu Ala Ser Pro Ala; Set Asp lie Leu Gly pys Ala 
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raa gga tag 
ilu Gly 



•120 



etc ccc tgg »gtg gac cct tac tgg atp egg tec aaa tga 
Leu Pro Trp val Asp Leu Tyr Trf> He Arg ser Lys 
110 115 j 
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aga get tct tgt tea ctt cga tgt ggc cat cgc cga gtt [tgc caa ctt 
Arg Ala Ser Cys Ser Leu Arg Cys Gly His Arg Arg Val pys Gin Leu 
125 \ 130 *L35 



gga cgt c aagtgagttt ccctttatgg ttggatcatc cgctcgaca'g actcgaaacg 
Gly Arg : \ \ I 
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ctcatcactt tggtctgett gatgaacagc tc teg gaa cgt cat? teg aga cat 2458 

| Leu Ser Glu Arg His Ser Arg Hia 
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cac teg caa- gac ggg taa egg tat ggc cga ctt tgc ttc 



His Ser Gin Asp Gly 
150 



Arg Tyr Gly Arg Leu Cys Phe 
l 155 

i ! 



145 

tct etc tac 
Ser Leu Tyr 
|160 



gec etc caa gee tgt ggc cga ggt cca gtc gac cga aga ttt caa cct 
Ala Leu Gin* Ala Cys Gly Arg Glir Prk Val Asp Arg Arg 
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Phe Gin Pro 



ata ctg tea tta cgt cgc tgg act cgt egg cga ggg act j etc ccg act 
He Leu Ser Leu Arg Arg Trp Thr Arg Arg Arg Gly Thr (Leu Pro Thr 

: 190 I 
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185 



ctt tgt cgc gac cga gaa gga acg ace att ctt ggc caa! c ca gat ggt 
Leu Cys Arg Asp Arg Glu Gly Th!r T*>r Xle Leu Gly Glnj Pro Asp Gly; 
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act ttc aaa etc gtt egg act cct; tec; cca aaa gac aaa cat cct teg 

Thr Phe Lys Leu Val Arg Thr Pro' Sei| Pro Lys Asp Lys His Pro Ser 
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i 220 



225 



aga tat teg gga gga cgc cga cgaj ag^ ccg tgg ctt ctg gec aag aga 
Arg Tyr Ser Gly Gly Arg Arg Arg Arg Ser Trp Leu Leu Kla Lys Arg 
230 | 235- jLo 

gat ctg ggc caa ccc gat eta tac' tg<? gca tgc acc ggg Lc aag gtt 
Asp Leu Gly Gin Pro Asp Leu Tyr Cys Ala Cys Thr Gly His Lys Val 
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255 



taa etc gtt! gac tga cct ggt cacj ga4 aga aaa cat cga caa agg ate 
Leu Val Asp Pro Gly Gl*i Glu Arg Lys His Arg Sin Arg lie 
260 j 265 [270 

1 i ■ 1 

eat gtg ggt gtt gag tgc gat gae act cga cgc gat cac teca tac tac 
Asn val Gly Val Glu Cys Asp Asp Th£ Arg Arg Asp His Ipro Tyr Tyr 
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cga cgc act! gga eta cct etc act tct aaa gaa cca gag jtgt ttt 
Arg Arg Thr Gly Leu Pro Leu Thr Seir Lys Glu Pro Glu [Cys Phe Gin 



290 



295 



300 



1 $ 2 
ctt ttg tgc^ tat ccc ggc tgt cat gtc gat tgc aac gtt fgga get atg 

Leu Leu Cys« Tyr Pro Gly Cys His Vail Asp Cys * Asn Val bly Ala Met 

305 • 310 j \ 315 I 320 



ctt cat gaa* ccc age ggt gtt cck acg aaa cat aaa aat jcag aaa ggg 
Leu His Glu Pro Ser Gly Val pro Thir Lys His Lys Asn )Gln Lys Gly 
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330 



aga age cgt; cga g gtgcgttcgc ghgtttetgtt tctacctttc ataacattgg 
Arg Ser Arg Arg ! 

• 340 " : 

aggttcttga etcttaageg tcttccaa'tc 
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tttttgcttt cctcttgttt etttteggeg tigattcaatc cag ct 



t|gatgcctce aattatcatc atttttgtct 
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cat tat gaa 
AlajHis Tyr Glu 
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gtg caa caa ccc teg gga ggt ggc; ata! cat gtt tag aga tta tgc teg 



val Gin Gin Pro Ser Gly Gly Glyj lie) His Val 
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Arg Leu Cys * Ser 
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aaa gat tea tgc caa ggc tat tec; tac; aga tec taa ctt cat caa gtt 
Lys Asp Ser Cys Gin Gly Tyr Serl Tyr Arg Ser lieu ftis Gin Val 

360 . ! • 365 I" 5 370 
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gag cgt tgc gtg tgg teg a gtgagttgdt cgategatcc atettttgtt 3386 
Glu Arg Cys Val Trp Ser \ \ 

375 . i 380 \ \ 
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ttgatcatcg cgagacttga ctgatcgat-t actcaaaaca tcatcgcttc teettcttgc 3446 
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tctctag at cga aca atg ggc tga; gca c tgtatgttcc tccgcccctc 



Asn «Arg Thr Met Gly 
I 385 
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cttcaagttt cctctcgctt catctttgtft gagaagaggg atctgatgta tctttctttg 



ttcggatcag ac ta ccc etc att dat gat gat teg gee ttc gaa tga ccc 



Leu Pro Leu He Tyr Asp Asp Ser Ala Phfe Glu- 



l?ro 



390 



395 



tea aaa ccc cgc acc etc aac ggc gc£ tga ccc ttt etc jagg aga cgc 
Ser Lys Pro Arg Thr Leu Asn Gly Ala Pro Phe Leu 'Arg Arg Arg 
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teg ttt aeg gac age etc taa gat ggc tga gat car; cc,c |cgc tgc tct 



Ser Phe Lys Asp Ser Leu 
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Glu Gly 
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Asp His Arg [Arg Cys Ser 
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3494 

3554 
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tgt cag gaa gaa age ccg gga tea cgc taa gtg gag aga ;gtc caa ggg 
Cys Gin Glu Glu Ser Pro Gly Ser Arb Val Glu Arg (val Gin Gly 
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440 



att gee tec gag cga tec gaa caa gee gga caa etc ggajgga tgc taa 
He Ala Ser Glu Arg Ser Glu Gin Ala Gly Gin Leu GlyJGly Cys 
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45*0 
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ttg ggt att gat egg egg tat ga : t. cgt tgg att gtt get i cgt gat ggg 
Leu Gly : lle Asp Arg Arg Tyr Asp Arg Trp lie Val Ala ' 
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cgc get egg ctt ggc cat cgc teg; ggd tgt tct tea g gtgcgttctt 
Arg Ala Arg Phe Gly Tyr Arg Leu! Gly Cys Ser Ser 
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ecaaagagee tttctctcat gaacacgcac ataggtcgat ctaattctat cncactctgt 



catacag tt 
Val 



ega gca ata a tctcaagatt ctagtccatc ctttcgctca 



Ala lie 



acgatctgcc tcttctcctt ctccttctcjc gtcttctctg gctcctttic tcactttctg 



r 

2tt 



accgatcctlt cacccgatct tggcgaaaat ct^aagcaaag agaatcacpt ttgtctaata 
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aaatttcctt taaagagtcg gcxccttctt gtggcgaagc ttcateecgt cttcetctgg 
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accatctctit ctcaatattc tttgtgctac tatatgatca agttctttga aatcaaagaa 



gaacatgtat ttgattttga ggttccaaga atacaacegg cccaagtcgt tettegcagt 
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tttcaccaga cageacatat ctctcctcct c^ctatagaa gccgtatggg gccaatcgac 
tctcatgggt agaccgtgcc cttttgacae ggggagaaag agaacgaa^ag gacactcgac 

< ■ ! j 

egattegtea ataaagcegt ccccaccttt tctttaatgg caatccaaga agagaaaaac 

: r 



aacccctgcg cgcacccgag tagecgatea gaccttccga acgacagata teatttgetg 4600 

: a 

; s t 

aaatcgaccg gattttaaag ctgctgccag gtcggtgaat ccccctaggt gatctcctcg 

! • i t 

I I 

tacaaagatg ttgggcacgg acttttcgac ceggatgaga acgtcgtgaa gagtttgaaa 
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aagattatca acataaegtg tetctttttc tjtttttcttt cgtaactctc tagagaacga 
ggagaegtae ggtctgattt gttatcg j j 
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tac etc gtt; ctg get ttc acg cac cct gec gat 
Tyr Leu Val| Leu' Ala Phe Thr His Ala Asp 



i i 



10 



i 



15 



48 



ctg cga get tta atg 
10 Leu Arg Ala Leu Met 

: 20 

ate act gca cag gag 
He Thr Xla Gin Glu 
15 35 



cag tac gcg ate tgg cat gag ccc pga agg aat 



Gin Tyr Ala; He Trp His Glu Pro Arg Arg Asn 

( 25 f 30 



gaa cat gca aca tec ggt tgg gac eg; 



a gaa act 

Glu His Kid Thr Ser Gly Trp Asp Arg Glu Thr 

40 | I 45 j 



96 



144 



20 



atg aag gaatgt tgg aag tat 
Met Lys Glu Cys Trp Lys Tyr 
50 55 

get gtc ate aaa gag ttg gac 
Ala Val ile'Lys Glu Leu Asp 
65 70 



ttg 
Leu 
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ctg act tea aga 
Leu Thr Ser Arg 
60 



=igt ttc 
Ser Phe 



gg4 
Gly 
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gat 
Asp 



ctt acc cga gtc |atc tgt 
Leu Thr Arg Valine Cys 
75 



gca 
Ala 



tta 
Leu 
80 



192 



240 



25 ttc tat etc get ctt cga gga 
Phe Tyr Leu «Ala Leu Arg Gly 
85 

eta tct aat ; gat gtg aag ctt 
30 Leu Ser Asnj Asp Val Lys Leu 
100 



ctg gat acc att gag gat (gac atg 
Thr He Glu Asp [Asp Met 
90 I 95 



Leu 

\ 

I 
| 

ccq 

Pro 

j 

I 



Asp 



cct gac tec- cct ggg tgg acc 
35 Leu Asp Ser Pro Gly Trp Thr 

. ii5' 

( 

gat aga gag ctt ctt gtt cac 
Asp Arg * Glu Leu Leu Val His 
40 130 t ' 135 

aac ttg. gac gtc aac tct egg 
Asn Leu Asp Val Asn Ser Arg 
145 150 



ttt 
Phk 
120 



I 

ctg 
Leu 
105 

i 

i 

4 
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ac, r t 
Thr 



ctt egg aca ttc 
Leu Arg Thr Phe 



gga tec ggt cca 
Gly Ser Gly Pro 
125 



tgg gaa 
;Trp Glu 
110 



aat gag 
Asn Glu 



ttp 
Piie 
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i 

aa'c 

Asn 

f 
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g^t 

Asp 

! 

! 
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t 

gt£c 
Val 



agt 
Ser 



aag 
Lys 



288 
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gtg gec ate gec 5 gag ttt 
Val Ala He Ala j Glu Phe 
140 



att cga gac ate 
lie Arg Asp He 
155 



act cgc 
Thr Arg 



aag 
Lys 



gec 
Ala 



aag 
Lys 
160 
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atg ggt aac ggt atg gcc gac etc! get; tec etc tct acg ccc tec aag 

Met Gly Asn Gly Met Ala Asp Phej Ala; Ser Leu Ser Thr Pro Ser Lys 

165 ; \ 170 175 
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ccc gtg gcc gag gtc cag teg acc; gaa; gat etc aac eta iac tgt cat 
Pro Val Ala Glu Val Gin Ser Thr, Glu Asp Phe Asn Leu Tyr Cys His 



180 



185 
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tac gtc get gga etc gtc ggc gag gga etc tec cga etc t:tt gtc gcg 
Tyr Val Ala .Gly Leu Val Gly Glu Glj Leu Ser Arg Leu Phe Val Ala 
* 195 20d \ 205 



acc gag aag gaa cga cca ttc teg gcc aac cag atg gta 



:ct tea aac 



Thr Glu Iiys Glu Arg Pro Phe Leu Al 4 Asn Gln . Met P eu Ser Asn 

210 : 215 ? | 220 

• M 

teg ttc gga etc ctt etc caa aag aca aac ate ctt cga gat att egg 
Ser Phe Gly : Leu Leu Leu Gin Lys Thr Asn He Leu Arg Asp lie Arg 
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i ! 
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gag gac gcc gac gaa ggt cgt gg<^ ttc tgg cca aga gag ate tgg gcc 
Glu Asp Ala Asp Glu Gly Arg Gl^ Phe Trp Pro Arg Glu ille Trp Ala 
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255 
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aac ccg ate tat act gcg cat gca ccg ggc aca agg ttt :aac teg ttg 

' * i 

Asn Pro He Tyr Thr Ala His Ala Pro Gly Thr Arg Phe [Asn Ser Leu 

260 • 265 f270 
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11 \ 

act gac ctg gtc aag aaa gaa aac ate gac aaa gga tea jatg tgg gtg 

Thr Asp Leu val Lys Lys Glu Asn He Asp Lys Gly Ser [Met Trp val 

275. 280 J 285 \ 

ttg agt gcg atg aca etc gac gcg ate ace cat act acc jgac gca ctg 

Leu Ser Ala Met Thr Leu Asp Ala He Thr His Thr Thr jAsp Ala Leu 

1 - 295 ! \ 
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3 00 



gac tac etc tea ctt eta aag aac cag agt gtt ttc aac J ttt tgt get 
Asp Tyr Leu Ser Leu Leu Lys Ash Gin Ser Val Phe Asn \ Phe Cys Ala 
305 1 310 | { 315 [ 320 



ate ccg get 
He Pro Ala 
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gtc atg teg att gca acg ttg gag eta tgc5ttc atg aac 



Val Met Ser lie Ala. Thr Leu Glu Leu Cye 
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Phe Met Asn 
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cca gcg gcg ;ttc 
Pro Ala Val Phe 
340 

5 gag etc att atg 

i i 

Glu Leu lie Met 
355 
i 

aga gat tat ,gct 
10 Arg Asp Tyr Ala 
370 



caa cga aac 
Gin Arg Asn 



at a: aaa* 



lie! 



aag tgc aac 
Lys Cys Asn 



cga aag att 
Arg Lys Xle 
375 



aae ( 
Asn) 

3 60' 

I 
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cat- 
His' 



Lys 

i 

345' 



cct" 

Pro 

2 



gcc 
AlJ 



ate 
lie 



egg 
Arg 



aag 
Lys 



aga aag gga gaa gcc gtc 
Arg Lys Gly Glu Ala val 

i 

gag gtg gca cac atg ttt 
Glu Val Ala yyr Met Phe 
365 



1056 



get att 
Ala He 
380 



cct aca gat cct 
Pro Thr Asp Pro 



1104 



1152 



aac etc ate aag 
Asn Phe He .Lys 
15 305 



gag cac tac • tac 
Glu His Tyr Tyr 
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25 



30 



35 



caa aac ccc gca 
Gin Asn Pro. Ala 
; 420 

cgt tea agg ; ata 
Arg Leu Arg* He 
435* 

gtc agg aag aaa 
val Arg Lys. Lys 

450 i 
ttg cct ccg age 
Leu Pro Pro Ser 
465 



ttg age gtt 
Leu Ser Val 
390 

ccc tea ttt 
Pro Ser Phe 
405 

ccc tea acg 
Pro Ser Thr 
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gc 
Aid 



atg 



tgt 
Cyi 



atg 



Met Met 



gcg 
Ala. 
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ggt cga ate gaa caa tgg get 
Gly Arg He Glu pin Trp Ala 
395 | 400 



att egg cct teg aat gac cct 
He Arg Pro Ser Asn Asp Pro 
410 \ 415 



1200 



gcc tct aag 
Ala Ser Lys 



gcc egg gat 
Ala Arg Asp 
455 

gat ccg aac 
Asp Pro Asn 
470 



tgg gta ctg ate ggc ggt atg 
Trp Val Leu He Gly Gly Met 
! 485 



40 gtg etc ggt ttg 
Val Leu : Glir Leu 
1 500 



get ate get 
Ala He Ala 



aag 
Lys 
440 
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cac 
. ! 
His 
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aag 

Lys 
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ate 
Ilk 

i 
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Trp 
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ctt 
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Leu 

425 

\ 

\ 

get 
Ala 

5 
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get 
Alk 



gac cct ttc tea gga gac get 
Asp Pro Phe Ser IGly Asp Ala 
|430 

gag ate acc gcc |gct get ctt 
Glu lie Thr Ala &la Ala Leu 
445 | 

r 

gag ?tcc aag gga 
Glu jser Lys Gly 



ccg 
f 

Pro 



aag 
Lys 

gac 
Asp 



gtjt 
val 



! 

gt!t 

val 
505 



gga 
Gly 
490 

gtt 
Val 



tgg aga 
Trp Arg 
460 
aac teg 
Asn Ser 
475 

ttg ttg 
Leu Leu 



gag jgat gtt aat 
GlujAsp Val Asn 



480 



ctt cag 
Leu Gin 



etc] gtg atg ggc 
Leu 1 Val Met Gly 
I . 495 

ttt! gag caa taa 
Phe | Glu Gin 
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Met Gly lie Ser Asp Tyr Leu Val; Le\i Ala Phe Thr His Pro Ala Asp 

l : . 5 i 



i 10 



Leu Arg Ala Leu Met Gin Tyr Alai He. Trp. His Glu Pro Arg Arg A9n 



10 



.20 



! 25| 
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He Thr Ala Gin Glu Glu His Ala Thr Ser Gly Trp Asp Arg Glu Thr 
35 40 I J 45 J 



40 I 
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15 Met Lys Glu. Cys Trp Lys Tyr.Leil As£ Leu Thr Ser Arg ker Phe Ala 



50 



55 



60 



Ala Val lie Lys Glu Leu Asp Gl^ Asp Leu Thr Arg Val [lie Cys Leu 
65 70 ' j I 75 ! 
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Phe Tyr Leu Ala Leu Arg Gly Lexi Asp Thr He Glu Asp ?Asp Met 
5 85 I . i 90 i 95 



80 
Ser 



I 



25 



Leu Ser Asn Asp Val Lys Leu Pro Leu Leu Arg Thr Phe -Trp Glu Lys 
; 100 | 105 jlio 



Leu Asp Ser Pro Gly Trp Thr Phe Thr Gly Ser Gly Pro 

US' 120 ! 125 

i I 

"S \ 

30 Asp Arg Glu. Leu Leu Val His Phe As£j Val Ala He Ala 
130 ! 135 i ) 140 

: i 

Asn. Leu Asp Val Asn. Ser Arg Asn Val lie Arg Asp He 
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Met Gly Asn Gly Met Ala Asp Phe Alja Ser Leu Ser Thr 
165 I | 170 
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Pro Val ;Ala Glu Val Gin Ser Thr Gl;u Asp Phe Asn Leu 

• • 180 • 



Asrv Glu Lys 



Glu Phe Ala 



Thr Arg Lys 
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Pro Ser Lys 
175 " 

Tyr Cys His 
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! 185 
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Tyr Val 'Ala Gly Leu Val Gly Glu Gly Leu Ser Arg Leu? Phe Val Ala 
.195 200 j 205' 
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Thr Glu Lys Glu Arg Pro Ph.© r»eu= Ala*; Asn Gin .Met Val Leu Ser Asn 
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Ser Phe Gly Leu Leu Leu Gin Lys; Thr> Asn lie Leu Arg 
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Glu Asp Ala Asp Glu Gly Arg Gly; Phe Trp Pro Arg Glu lie Trp Ala 
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p lie Arg 
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10 Asn Pro lie Tyr Thr Ala His Ala Pro Gly Thr Arg Phe Asn Ser Leu 
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! 265 
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Thr Asp Leu Val Lys Lys Glu Asii lie Asp Lys Gly Ser Met Trp val 
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280' 



15 



285 



Leu Ser Ala Met Thr Leu Asp Ala! lie Thr His Thr Thr Asp Ala Leu 
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300 



20 305 



Asp Tyr Leu: Ser Leu Leu Lys Asri Glii-Ser Val Phe Asn Phe Cys Ala 

310 ! J 315 320 



lie Pro Ala .Val Met Ser lie A14 Thr Leu Glu Leu Cys fhe Met Asn 

325 j ? ? 
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335 
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25 Pro Ala Val Phe Gin Arg Asn lie Lys He Arg Lys Gly jSlu Ala Val 

1 340 I 34$ f350 
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Glu Leu lie Met Lys Cys Asn Asn Pro Arg Glu Val Ala !Tyr Met Phe 
30 355. 360 ( 365 | 

- M l 

Arg Asp Tyr* Ala Arg Lys He His Ala Lys Ala He Pro >Thr Asp Pro 

1 330 \ 



370 



375 



35 Asn Phe lie; Lys Leu Ser Val Ala Cyf Gly Arg lie Glu 
385 390 \ I 395 



Glu His Tyr Tyr Pro ser Phe Met Me!t He Arg Pro Ser 
405 ! j 410 • 

40 j 



Gin Trp Ala 

400 

Asn Asp pro 
415 



Gin Asn Pro Ala Pro Ser Thr Al?a Leu Asp Pro Phe Ser? Gly Asp Ala 
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Arg Leu Arg He Ala Ser Lys Lyel Ala- Glu He tfhr Ala Ala Ala Leu 
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Val Arg Lys Lys Ala Arg Asp Hisj Alaj Lys Trp Arg Glu 
450 ; 455 j J 460 



Ser Lys Gly 



Leu Pro Pro $er Asp Pro Aen Lys; Pro Asp Asit Ser 
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10 Trp Val Leu :lle Gly Gly Met lie? Val' Gly Leu Leu 
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Glu Asp Val Aan 
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Leu yal Met Gly 
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